FIGURE l 



.GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCT'GGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGGTCTGAAGACTAACATTTTGTGAAGTTGTAAAACAGAAAACCTGTTAGAAATGTGGTGGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
T CATACAT TAC T GC AGTAACACT CCAC CATATAGAC CC G GC T T T ACCT T AT AX CAGT GACACT GG 
TACAGTAGCTCCAGAAAAATGCTTATTTGGGGCAATGCTAAATATTGCGGCAGTTTTATGCATTG 
CTACCATTTATGTTCGTTATAAGCAAGTTCATGCTCTGAGTCCTGAAGAGAACGTTATCATCAAA 
TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
TAT AT AT G T T T GT T CAGACCATC C T T T C CTACCAAATGCAGCCCAAAAT CC AT GGC AAAC AAGTC 



TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
|f| ATCAGTTTTGCACAGTGGCAATTTTGGGACTGATTTAGAACAGAAACTCCATTGGAACCCCGAGG 
til ACAAAGGTTATGTGCT TCACATGAT CACTACTGCAGCAGAATGGTC TATGTCATT TTCCT TC TT T 

GGTTTTTTCCTGACTTACATTCGTGATTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACA 
TGGATTAACCCTCTATGACACTGCACCTTGCCCTATTAACAATGAACGAACACGGCTACTTTCCA 
GAGATAT T TGA T GAAAGGATAAAAT AT T T C T G TAAT GAT TATGAT TCT C AGG GAT TGGGGAAAGG 
m TTCACAGAAGTTGCTTATTCTTCTCTGAAATTTTCAACCACTTAATCAAGGCTGACAGTAACACT 
jJl GAT GAAT G C T GAT AAT CAGGAAAC AT G AAAGAAGC C AT T T GAT AG AT T AT T C T AAAGG AT AT CAT 

O CAAGAAGACTATTAAAAACACCTATGCCTATACTTTTTTATCTCAGAAAATAAAGTCAAAAGACT 

m 

5 



□ 



m 
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FIGURE 2 

<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 8.39, NX(S/T): 0 

mwwfqqglsflpsalviwtsaa.fi fsyitavtlhhidpalpyi sdtgtvapekclfgamlniaav 
lc i at i yvr ykqvhals peemv 1 1 klnkaglvlg i ls clgl s ivan fqktt l faah vs gavl t fg 
mgslymfvqtilsyqmqpkihgkqvfwirlllviwcgvsalsmltcssvlhsgnfgtdleqklhw 

NPE DKGYVLHM I T T AAE WSMS FSFFGFFLTYIRDFQKIS LRVEANLHGL T L Y DT AP C P I NNE RT R 
LLSRDI 



f '""*'■ Important features: 

**** Type II transmembrane domain: 

CI 

H! amino acids 13-33 

W 

f jf Other Transmembrane domains : 

In 

amino acids 54-73, 94-113, 160-180, 122-141 



m N-rnyristoylation sites. 

m amino acids 57-63, 95-101, 99-105, 124-130, 183-189 

1 

fii 
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FIGURE 3 



CGGACGCGTGGGCGGACGCGT GGGGGAGAGCCGCAGT CCCGGCTGCAGCACCTGGGAGAAGGCAGACC 
GTGTGftGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CTTACACTTCGCCATGAGTTTCCTCATCGACTCCAGCATCATGATTACCTCCCAGATACTATTTTTTG 
GATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGTATTGAATAGCAGCTCGCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 
TTT T CAT GGTGCCTTT TTAC ATT GGCTATTTTATTGT GAGCAATATCCGAC TACTGCATAAACAACGA 
CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 
CAT T CT CAGCCCAAAACAT GGGATCTTAT CCAT AGAACAGCTCATCAGCCGGGTTGGTG T GATT GGAG 
Q TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 
Q CTCAGGAATGTGACTGACACGGATATTCTAGCCCTGGAACGGCGACTGCTGCAAACCATGGATATGAT 
CATAAGCAAAAAGAAAAGGATGGCAATGGCACGGAGAACAATGTTCCA.GAAGGGGGAAGTGCATAAC^ 
AACCATCAGGT TTCTGGGGAA.TGATAAAAAGTGTTAC CACTTCAGCATCAGGAAGTGAAAATCTTACT 
CTTATTCAACAGGAAGTGGATGCTTTGGAAGAATTAAGCAGGCAGCTTTTTCTGGAAACAGCTGATCT 
ATATGCT ACCAAGGAGAGAATAGAATACTCCAAAACC T TCAAGGGGAAATATTTTAATTT T CTT GGTT 
ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 



If! 
ill 

m 

m 



Q GGGAAAACGGATCCTGTCACAAGAGGCATTGAGATCACTGTGAATTATCTGGGAATCCAATTTGATGT 
1=11 GAAGTTTTGGTCCCAACACATTTCCTTCATTCTTGTTGGAATAA.TCATCGTCACATCCATCAGAGGAT 



O 

HI 
M 

fll 



TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 

CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 

AGAATACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTTTG 

ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 

GAGAAGCAAATGGCACCT TGAA CTTAAGCCTACTACAGACTGTTAGAGGCCAGTGGTTTCAAAATTTA 

GATATAAGAGGGGGGAAAAATGGAZICCAGGGCCTGACATTTTATAAACAAACAAAATGCTATGGTAGC 

ATTTTTCACCTTCATAGCATACTCCTTCCCCGTCAGGTGATACTATGACCATGAGTAGCATCAGCCAG 

AACATGAGAGGGAGAACTAACTCAAGACAATACTCAGCAGAGAGCATCCCGTGTGGATATGAGGCTGG 

TGTAGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGT 

CTATGGTAGCTGAGCCAAACACGTAGGATTTCCGTTTTAAGGTTCACATGGAAAAGGTTATAGCTTTG 

CCTTGAGATTGACTCATTAAAATCAGAGACTGTAACAAAAAAAAAAAAAAAAAAAA^ 

ACTCTAGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 
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FIGURE d. 



MSFLIDSSIMITSQILFFGFGWLFFMRQLFKDYEIRQYWQVIFSVTFAFSCTMFELIIFEILGV 
LNSSSRYFHWKMNLCVILLILVFWPFYIGYFIVSNIRLLHKQRLLFSCLLWLTEKYFFWKLGDP 
FPILSPKHGILSIEQLISRVGVIGVTLMALLSGFGAW^ 

TMDMIISKKKR^1AMARRTMFQKGEVHNKPSGFWGMIKSVTTSASGSENLTLIQQEVDALEELSRQ 
LFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCVWKIFMATINIVFDRVGKTDPVTRGIEI 
TVNYLGI QFDVKFWS QHI S FI LVG 1 1 1 VTS I RGLL I TLTKFFYAI S S SKS SNVI VLLLAQ IMGMY 
FVSS VLL I RMSMPLE YRT 1 1 TE VLGEL Q FNFYHRW FD V I FLVSAL S S I L FLYLAHKQAPEKQMAP 

Important features: 
Signal peptide: 



** 

s 

n\ amino acids 1-23 

■fa.* 

Ml 

if' Potential transmembrane domains: 



amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 



Jjl N-glycosylation sites. 

U amino acids 67-70, 180-183 and 243-246 

m 

6 ' 

'Jfs Eukaryotic cobalamin-binding proteins 

amino acids 151-160 
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FIGURE s 



AGCAGGGAAATCCGGATGTCTCGGTTATGAAGTGGAGCAGTGAGTGTGAGCCTCAACATAGTTCC 
AGAACTCTCCATCCGGACTAGTTATTGAGCATCTGCCTCTCATATCACCAGTGGCCATCTGAGGT 
GTTTCCCTGGCTCTGAAGGGGTAGGCACGATGGCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
GTCATGCAGAATTATGGGGATCACCCTTGTGAGCAAAAAGGCGAACCAGCAGCTGAATTTCACAG 
AAGC TAAGGAGGC CT GTAGGC T GCT GGGAC TAAGT T TGGCCGGCAAGGACCAAGT T GAAACAGC C 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAG T T T GCAGCC TAT TGT TACAAC TCATC TGAT AC TT GGAC TAACT CGTGC AT TCCAGAA 
AT TATCACCACCAAAGATCCCATAT TCAACAC TCAAACTGCAACACAAACAACAGAAT TTATTGT 
CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
C T GC T CC AGCT T CC ACT TC TAT T CCACG GAGAAAAAAAT T GATT TG T G T C ACAGAAGT T T TT AT G 
GAAACT AGCACCAT GT CTACAGAAACT GAACCATTTGTTGAAAATAAAGCAGCATTCAAGAAT GA 
AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
C AGCTGG TC T TGGAT T T T G C TAT G T CAAAAGG TAT GT GAAGGC C TTCCCTTT TACAAA.CAAGAAT 
CAGCAGAAGGAAAT GATCGAAACCAAAGTAGTAAAGGAGGAGAAGGC CAAT GATAGCAACCCTAA 
TGAGGAATCAAAGAAAACTGATAAAAACCCAGAAGAGTCCAAGAGTCCAAGCAAAACTACCGTGC 
GATGCCTGGAAGCT GAAGT TTAGAT GAGACAGAAATGAGGAGACACACCT GAGGCTGGTTTCT T T 



hi - 
;g CATGCTCCTTACCCTGCCCCAGCTGGGGAAATCAAAAGGGCCAAAGAACCAAAGAAGAAAGTCCA 

IP CCCTTGGTTCCTAACTGGAATCAGCTCAGGACTGCCATTGGACTATGGAGTGCACCAAAGAGAAT 

ill GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 

TTCTAGCCTGGCTATGTCCTAATAATATCCCACTGGGAGAAAGGAGTTTTGCAAAGTGCAAGGAC 

CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 

\_ AAAGCCAAGGAGTCACTGAGACCAAGGCTTTCTCTACTGATTCCGCAGCTCAGACCCTTTCTTCA 

CI GCT CT GAAAGAGAA&CACGTAT C C CACC TGAC ATGT C C T T C TGAG CC CGG T AAG AGCAAAAGAAT 

y| GGCAGAAAAGTTTAGCCCCTGAAAGCCATGGAGATTCTCATAACTTGAGACCTAATCTCTGTAAA 

m GCTAAAATAAAGAAAmGAACAAGGCTGAGGATACGACAGTACACTGTCAGCAGGGACTGTAAAC 

$fi ACAGACAGGGTCAAAGTGT T TTCTCT GAACACATTGAGTTGGAATCACT GTT TAGAACACACACA 

I;; CTTACTTTTTCTGGTCTCTACCACTGCTGATATTTTCTCTAGGAAATATACTTTTACAAGTAACA 

|3 AAAAT AAAAA.C T C T T AT AAAT T T C TAT T T T T A T C T GAG T T AC AG AAAT GAT T AC T AAG GAAG AT T 

f|| AC TCAG TAAT T T GT TT AAAAAGT AATAAAAT TCAACAAAC AT T T GCT GAAT AGC TACT ATAT G T C 

AAGTGC TGT GCAAG G TAT T ACACT C T GT AAT T GAAT AT TAT T CC T CAAAAAAT T GCAC AT AGT AG 

AACGCT ATCT GGGAAGC T ATT TT T T T CAGT T T TGATAT T T C TAGC T T AT C T ACT TCCAAACT AAT 

TTTTATTTTTGCTGAGACTAATCTTATTCATTTTCTCTAATATGGCAACCATTATAACCTTAATT 

TAT TATTAACATAC C TAAGAAGT ACATTGT TACCTCTATATACCAAAGCACATTTTAAAA3TGCC 

AT TAACAAATG T AT CAC T AGCCC TC CT T T TT CCAACAAGAAGGGAC T GAGAGATGCAGAAATAT T 

TGT G AC AAAAAAT T AAAGC AT T T AGAAAAC T T 
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FIGURE 6 



m 



MARCFS LVLLLTS I WT TRLLVQGS LRAEEL S I QVSCRIMG I TLVSKKANQQLN FTEAKEACRLLG 
LSLAGKDQVETALKAS FETCSYGWVGDGFWISRI SPNPKCGKNGVGVL I WKVPVSRQFAAYCYN 
SSDTWTNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKKLICVTEVFMETSTMSTETEPFV^ 

RYVKAFPFTNKNQQKEMIETKWKEEKANDSNPNEESKKTDKNPEESKSPSKTTVRCLEAEV 

Signal sequence: 

amino acids 1-16 



p! Transmembrane domain: 

amino acids 235-254 

m 

|i N-glycosylation site. 

H Q amino acids 53-57, 130-134, 289-293 

I 

% Casein kinase II phosphorylation site. 

Pi 

Y«l amino acids 145-149, 214-218 

fit 

p| Tyrosine kinase phosphorylation site. 

Q amino acids 79-88 



N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 
GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 
CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 
CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 
GCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGCGAGGTTGAGGAACTGAT 
GGAGGACACGCAGCACAAATTGCGCAGCGCGGTGGAAGAGATGGAGGCAGAAGAAGCTGCTGCTA 
AAGCATCATCAGAAGTGAACCTGGCAAACTTACCTCCCAGCTATCACAATGAGACCAACACAGAC 
ACGAAGGT TGGAAATAATAC C AT CCATG TGCACC GAGAAAT T C ACAAGATAACCAACAAC CAG AC 
T GGACAAATGGT CT TT TCAGAGACAGT TATCACATCTGTGGGAGACGAAGAAGGCAGAAGGAGCC 
ACGAGTGCATCATCGACGAGGACT GTGGGCCCAGCATGTACTGCCAGT T TGCCAGCT TC CAGTAC 
. t ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 
H GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 
K| ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 
Q ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 
p| CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 
CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 
SI GAGATCCTGCTGCCCAGAGAGGTCCCCGATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCG 
.'^ CCAGGAGCTGGAGGACCTGGAGAGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTG 
CCGCCGCTGC ACTGC T GGGAGGGGAAGAGAT T TAGAT CTGGACCAGGC T G T GGGTAGATGTGCAA 
? &f TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 
? CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 
Q CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 
Lff GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 
p| TTTGTTCTACATGGCTTTGATAATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTC 
TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATG 
CAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGC 
AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 
TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 
AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 
CAAGTCACACAGCTAGTGAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCT 
CTCCACTACCCCACACCAGCCTTGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGAT 
T T TTCT TGAGGCAT GCACAT CTGGAATTAAGGTCAAACTAATTCTCACATCCCTC TAAAAGTAAA 
CTACTGTTAGGAACAGCAGTGTTCTCACAGTGTGGGGCAGCCGTCCTTCTAATGAAGACAATGAT 
AT T GAC ACTGT C CC T C T T TGGCAG T T GCAT TAGTAAC T T T GAAAG GT AT AT GACT GAGCGT AGCA 
T AC AGG T T AACC TGCAGAAAC AGTAC T TAGGTAAT T G T AGGGCGAGGAT T AT AAAT GAAAT T TGC 
AAAAT CACT TAGCAG CAAC T GAAGACAAT TATCAACCACGTGGAGAAAAT CAAACC GAGCAGGGC 
T G T G T GAAAC AT GGT T G TAAT AT GCG AC TGCGAACAC T GAACT CTAC GC CAC T CC ACAAATGAT G 
TTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTGCA 
CATGATTGTATAAGCATGCTTTCTTTGAGTTTTAAATTATGTATAAACATAAGTTGCATTTAGAA 
ATC AAGCATAAATCACTTCAACT GCAAAAAAAAAAAAAAAAAAAAAAAAAA 



m 

m 
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FIGURE 8 



MQRLGATLLCLLLAAAVPTAPAPAPTATSAPWPGPALSYPQEELATLNEMFREVEELMEDTQHKL 
RSAVEEMEAEEAAAKAS SEVTSILANLPP S YHNETNTDTKVGNNT I HVHRE IHKI TNNQTGQMVFSE 
TVI TS VGDEEGRRSHECI I DEDCG PSMYCQFAS FQYTCQPCRGQRMLCTRDSECCGDQLCWGHC 
TKMATRGSNGTICDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDG 
ALDRCPCASGLLCQPHSHS LVYVCKPT FVGSRDQDGE I LLPRE VPDE YEVGS FMEEVRQE LEDLE 
RSLTE EMALGE PAAAAAALLGGEE I 



Signal sequence: 

I : amino acids 1-19 

1^1 N-glycosylation site. 

#J amino acids 96-100, 106-110, 121-125, 204-208 

m 

i f% Casein kinase II phosphorylation site. 

*|| amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 

n 327-331 

Gi . 

in 

D 
ft! 

Ill Amidation site. 

amino acids 140-144 



N-myristoylation site. 

amino acids 202-208, 217-223 
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FIGURE q 



CGGACGCGTGGGCGGACGCGTGGGQGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
GAGTATAT T GAC CACAAGC TCAGAGGAG TGCAGAAC C C TGTAGCCCGCT G CAT CATGTGCTGTTT 
CAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCCTAAACCGCAATGCATACATGATGA 
H TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 



ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 



««-r. GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 
y ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 
Pi GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 



GGAAGACCTGGAGCGGAACAACGGCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 
AGATTCTGGGCAAGAAGAACGAGGCGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCTCCGG 
*?* CCCTGATCCAGGACTGCACCGCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 
jjf CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 
TTGAGAGGCTGAGGCGGGCGGATGACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTG 
AAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 
GCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 
GAT CGCGCCACT GCACTCCAACCT GGGT GACAGAC TCTGTCTC CAAAAC AAAACAAACAAACAAA 
AAGAT T T TATTAAAGATAT T T TGT T AAC T C 
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FIGURE lo 



RTRGRTRGGCEKVPINTSCNPTAHLVNSSCPGLMCVFQGYSSKGLIQRSVFNLQIYGVLGLFWTL 
l^VIJ^GQCVIAGAFASFyWAFHKPQDIPTFPLISAFIRTLRYHTGSLAFGALILTLVQIARVIL 
E Y I DHKLRGVQNPVARC IMCC FKCCL WCLEKF I KFLNRNAY IMI A I YGKNFCVS AKNAFMLLMRN 
I WVVVLDKVTDLLLFFGKLL WGGVGVLS FFFFSGRIPGLGKDFKSPHLNYYWLPIMTS I LGAY 
V I AS GFFS VFGMCVDTL FLC FLE DLERNNGS LDRPYYMSKSLLK I LGKKNE APPDNKKRKK 

Important features: 
Transmembrane domains: 

amino acids 57-80 (type II), 110-126, 215-231, 254-274 



u 

£3 

Q N-glycosylation sites. 

f|! amino acids 16-20, 27-31, 289-293 

lil 

m 
m 



Hypothetical YBR002c family proteins. 

amino acids 27 6-288 



|3 Ammonium transporters proteins. 

Ul amino acids 204-231 

III 

pi N-myristoylation sites. 

HI amino acids 60-66, 78-84 

Amidation site. 

amino acids 306-310 



AppJD=10063699 



Page 164 of 322 



FIGURE 11 



GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGCCTGCCTGGGAGCCTGC 
TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 
CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 
TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 
GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 
CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 
GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 
ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 
CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 
GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 
Q TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 
M 5 CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 

W 



CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 
TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAACCCCCATTTGCCAACCCAGCT 
GGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAGACCCAGTGGTGGGATGCCCCGAGCATTG 
TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 
O AACAGCCTGATGCAGACCGAGGAGTGCCCACCTATGCTAGACGCCACACAGCAGCAGCAGCAGCAGGT 
U1 GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 



SI 



ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 
ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 
CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCTGAGGCAGCCTCA 
CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 
CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 
CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 
ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 
AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 
TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 
GACTTCGTGCCTTACTGAGTCTCTAAGACTTTTTCTAATAAACAAGCCAGTGCGTGTAAAAAAAA 
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FIGURE 12 



MGACLGACSLLSCASCLCGSAPCI LCSCCPASRNSTVSRLI FTFFLFLGVLVS IIMLSPGVESQL 
YKLPWVCEEGAGIPTVLQGHTDCGSLLGYRAVYRMCFATAAFFFFFFTLLMLCVSSSRDPRAAIQ 
NGFWFFKFLILVGLTVGAFYIPDGSFTNIWFYFGWGSFLFILIQLVLLIDFAHSWNQRWLGKAE 
ECDSRAWYAGLFFFTLLFYLLS IAAVALMFMYYTEPSGCHEGKVFISLNLTFCVCVS IAAVLPKV 
QDAQPNS GLL QAS VI T LYTMFVTW SALS S I PEQKCNPHLPTQLGNE TWAGPEGYETQWWDAPS I 
VGLIIFLLCTLFISLRSSDHRQVNSLMQTEECPPMLDATQQQQOQVAACEGRAFDNEQDGVTYSY 
SFFHFCLVXiASLHVMMTLTNWYKPGETRK^ 
FS 

Signal sequence : 

amino acids 1-20 

Transmembrane domains: 

amino acids 40-58, 101-116, 134-150, 162-173, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 

I 
% 

i 
$ 

3 
1 
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FIGURE 13 



CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GGT TGGAAAAAGAC T C C T G T AAC C C T C C T C C AGGATGAACC ACC TGCCAGAAGACAT GGAGAAC G 
CTCTCACCGGGAGCC^GAGCTCCCA.TGCTTCTCTGCGCAATATCCATTCCATCAAGCCCACACAA 
CTCATGGCCAGGATTGAGTCCTATGAAGGAAGGGAAAAGAAAGGCATATCTGATGTCAGGAGGAC 
TTTCTGTTTGTTTGTCACCTTTGACCTCTTATTCGTAACATTACTGTGGATAATAGAGTTAAATG 
TGAATGGAGGCATT GAGAACACATT AGAGAAGGAGGTGAT GCAGTAT GACT ACTATTCTT CATAT 
T TTGATATATTTCTTCTGGCAGT T T T T CGAT TTAAAG TG TTAATACT TGCATATGCTGTGTGCAG 
ACT GCGCCAT T GGT GGGCAATAGCGT T GACAAC GG C AGT GACC AGT GCCT T T T TAC TAGCAAAAG 
TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
El CTTGCCTGGATTGAGACGTGGTTCCTGGATTTCAAAGTGTTACCTCAAGAAGCAGAAGAAGAAAA 



CAGACTCCTGATAGTTCAGGATGCTTCAGAGAGGGCAGCACTTATACCTGGTGGTCTTTCTGATG 
GTCAGTTTTATTCCCCTCCTGAATCCGAAGCAGGATCTGAAGAAGCTGAAGAAAAACAGGACAGT 



D 

m 

id 

||| G AGAAACC ACT TT T AGAAC T ATGAGTAC TAC T T T T GT T AAATGTGAAAAAC CCTCAC AGAAAG T C 

$U ATCGAGGCAAAAAGAGGCAGGCAGTGGAGTCTCCCTGTCGACAGTAAAGTTGAAATGGTGACGTC 

>?W C ACTGC T GGC TT TAT T GAACAGC TAAT AAAGATT TAT T TAT TG TAAT ACCT CACAAACGT TGTAC 

l s CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 

i'h TCAGTGAGACTGAGCCTGATGTGTTAACAAATAGGTGAAGAAAGTCTTGTGCTGTATTCCTAATC 

Q AAAAGACT TAAT AT AT T GAAG TAAC AC T T T T T TAG T AAGC AAGAT AC CT T T T TAT T T CAAT T CAC 

|ll AG AATGGAATT TTTTTGTTT CAT G T C T CAGAT T TAT T T T GTAT T T CT T T T T TAACACT CT AC AT T 

fcl TCCCTTGTTTTTTAACTCATGCACATGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTG 

' ftl 

f *■* ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
CTTGCAAAAGGGGAAGAAAGGAATTGCGAATACATGTAAAATGTCACCAGACATTTGTATTATTT 
T TAT CAT GAAAT CAT GTTTTTCTCT GAT T G T TC TGAAAT GT TC T AAATAC T CT TAT TT T GAATGC 
ACAAAATGACTTAAACCATTCATATCATGTTTCCTTTGCGTTCAGCCAATTTCAATTAAAATGAA 
CTAAAT T AAAAA 
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FIGURE 14 

MNHLPEDMENALTGSQSSHASLRNIHSINPTQ^ 

VTLLWI IELNVNGGI ENTLEKEVMQYDYYS S Y FD I FLLAVFRFKVL I IAYAVCRLRHWWAI ALTT 
AVTSAFLLAKVILSKLFSQGAFGYVLPIISFIIAWIETWFLDFKVLPQEAEEENRLLIVQDASER 
AAL I PGGLS DGQFYS PPESEAGSEEAEEKQDSEKPLLEL 



Important features of the protein: 
Signal peptide: 

amino acids 1-20 



fcl Transmembrane domains: 

W amino acids 54-72, 100-118, 130-144, 146-166 

91 

ill 

p| N-myristoylation sites, 

g| amino acids 14-20, 78-84, 79-85, 202-208, 217-223 

ill 
# 

O 

If! 

O 

m 
•i?<?.% 

it 
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FIGURE is 



ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 
CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTGCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 
CGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCATCACCATGCTCGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 
TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 
AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 
CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 
GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 
TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 
CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 
TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 
TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 
TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 
CTTCCCGCCGAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 
CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 
TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 
CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 
ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 
GGCCTGTACTGTGAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCCTACACCAGTCACGCCGAG 
GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 
AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 
GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 
GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 



!5SS 



m GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 
CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 



GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 
f!l. ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 
CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 
ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCA 
GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 
ACACCACGTAAGTTCTCAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 
GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 
CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 
CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 
TCCAGGCGGACCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 
GGCTGTGTGACTCTAGTCTTGGCCCCAGGAAGCGAAGGAACAAAAGAAACTGGAAAGGAAGATGC 
T T T AGGAACAT GTTTTGCTTTTT T AAAATAT ATA TAT T T AT AAGAGA TC C T T T CC C AT T TAT TC T 
GGGAAGATGTTTTTCAAACTCAGAGACAAGGACTTTGGTTTTTGTAAGACAAACGATGATATGAA 
GGC C T T T TGTAAGAAAAAATAAAAGATGAAGTGT GAAA 
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FIGURE 16 

MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 
MLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHEITNETFRGLRRLERLY 
LGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLLALEPGILDTANVE 
ALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 
AGLAALQELDVSNLSLQALPGDLSGLFPRLRLLAAARNPFNCVCPLSW FGPWVRESHVTLASPEE 
TRCHFPPPCNAGRLLLELDYADFGCPATTTTATVPTTRPWREPTALSSSLAPTWLSPTAPATEAP 
SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 
VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 
t f 4 TVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPAVHSNHAPVTQAREGNLPLLIAPALAAV 
%£ LLAALAAVGAAY CVRRGRAMAAAAQDKG QVGP GAG P LE LE GVKVPLEPGPKATEGGGEAL PS GSE 

|;J CEVPLMGFPGPGLQSPLHAKPYI 

Important features: 
Signal peptide: 

amino acids 1-23 
Transmembrane domain: 

amino acids 57 9-599 

EGF-like domain cysteine pattern signature, 
amino acids 430-442 
Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-rnyri s toyla ti on sites. 

amino acids 23-29, 27-33, 70-76, 142-143, 187-193, 348-354, 
594-600, 640-646 



>r1; 



m 
m 
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FIGURE 17 



GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTCATGCG 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
ATGAAGAAGGCAGCCAGGATGAATCCTTAGATTCCAAGACTACTTTGACATCAGATGAGTCAGTA 
AAGGACCATACTAC TGCAGGCAGAGTA.GT TGCTGGTCAAATATTTCT TGATTCAGAAGAATCTGA 
ATTAGAATCCTCTATTCAAGAAGAGGAAGACAGCCTCAAGAGCCAAGAGGGGGAAAGTGTCACAG 
AAGATAT CAGCT T T GT AGAGTCT CCAAAT CCAGAAAACAAGGAC TAT GAAGAGCCAAAGAAAG TA 
CGGAAACCAGCTTTGACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCT 
T T T CCTAGATAAGGAGTATGATGAATGT ACATCAGATGGGAGGGAAGAT GGCAGACT GTGGT GT G 
CTACAACCTATGACTACAAAGCAGATGAAAAGTGGGGCTTTTGTGAAACTGAAGAAGAGGCTGCT 
AAGAGACGGCAGATGCAGGAAGCAGAAATGATGTATCAAACTGGAATGAAAATCCTTAATGGAAG 
C AATAAGAAAAGC C AAAAAAG AGAAG CAT AT C G G TAT C T C CAAAAGG CAG C AAGCAT GAACC AT A 
CCAAAGCCCTGGAGAGAGTGTCATATGCTCTTTTATTTGGTGATTACTTGCCACAGAATATCCAG 
L&f GCAGCGAGAGAGATGTT TGAGAAGC TGACTGAGGAAGGCT C T CCCAAGGGACAGACTGCT CT TGG 

CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 
TTGGAGCTCTTGGGGGCAATCTAATAGCCCACATGGTTTTGGTAAGTAGACTTTAGTGGAAGGCT 



B 
m 



m 



m 
rii 



AATAATATTAACATCAGAAGAATTTGTGGTTTATAGCGGCCACAACTTTTTCAGCTTTCATGATC 



CAGATTTGCTTGTATTAAGACCAAATATTCAGTTGAACTTCCTTCAAATTCTTGTTAATGGATAT 
|fl AACACATGGAAT CT ACAT G T AAAT GAAAGT T GGT GGAGTC CACAAT TT T T C T T TAAAATGAT TAG 

T TT GGCT GAT T GCC CC TAAAAAGAGAGAT C T GATAAAT GGC TC T T T T T AAAT T T T C T C T GAGT TG 
GAATTGTCAGAATCATTTTTTACATTAGATTATCATAATTTTAAAAATTTTTCTTTAGTTTTTCA 
AAAT T T T G TAAAT G G T GGC T AT AG AAAAACAAC AT GAAAT AT T AT AC AAT AT T T T G C AAC AAT G C 
CCTAAGAATTGTTAAAATTCATGGAGTTATTTGTGCAGAATGACTCCAGAGAGCTCTACTTTCTG 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGAT T AAC T CAT T T T TAAT AAAAT TAT GT CT AAGAT T AAAAAAAAAAAAAAA?UIAA?^AAAAAAA 
AAAi^\AAAAAAAAAAAAAAAAAAAAAA?U\AAAAAAAAAAAAAAAA^ 
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FIGURE 18 



Iff 

P 



MRVRIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRVVAGQIFLDSEESEL 
ESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHGEPCHFPFLFLDK 
EYDECTSDGREDGRLWCATTYDYK^J)EKWGFCETEEEAAKRRQMQEAEmYQTG^ILNGSNKKSQKR 
EAYRYLQKAASMNHTKALERVSYALLFGDYLPQKIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
SSQAKALVYYTFGALGGNLIAHMVLVSRL 

Important features: 
Signal peptide: 

amino acids 1-21 



M 

|y N-glycosylation sites. 

£fl amino acids 195-199, 217-221, 272-276 

ty 

PS* 



Tyrosine kinase phosphorylation site. 

amino acids 220-228 



m 

n 

Q N-myristoylation sites. 

amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 



Glycosaminoglycan attachment site. 



ZU amino acids 267-271 

III 



Microbodies C-terminal targeting signal. 

amino acids 299-303 

Type II fibronectin collagen -binding domain protein. 

amino acids 127-169 



Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 
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FIGURE iq 



AAT T CAGAT T T T AAGCCCATTC T G C AGT GGAAT T T CAT GAAC TAGCAAGAGGAC AC CAT C T T C T T 
GT ATTATACAAGAAAGGAGTG TACC TATC ACACACAGGGGGAAAA ATGC TCTT T1 GGGTGCTAGG 
CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAGACATCACTG 
ATAAGTACAT T T T TAT CAC T GGAT GTGACTCGGGC T T T G G AAACT T GGC AGCCAGAAC T T T T GAT 
AAAAAGGG AT T TCAT GT AATCGCT GC CT GT C T GAC T GAAT CAGGAT CAACAG C T T TAAAGGC AGA 
AACCTCAGAGAGACTTCGTACTGTGCTTCTGGATGTGACCGACCCAGAGAATGTC^GAGGACTG 
CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 
C CC GGCGTGCTGG CT C CCACTGAC T GGC TGACAC TAGAGGACTACAGAGAACC TAT TGAAG T G AA 
CCTGTTTGGACTCATCAGTGTGACACTAAATATGCTTCCTTTGGTCAAGAAAGCTCAAGGGAGAG 
Q TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 
hit T AT G CAG T GGAAG GT T T CAATGACAG C TTAAGAC G GGACAT GAAAGC T T T T G GT GT GC AC G T C T C 

7'; ATGCAT TGAACCAGGAT T GTTCAAAACAAACT T GGCAGAT C CAGTAAAGG T AAT T GAAAAAAAAC 

\M 

T C GC CA T T T G G GAGC AG C T GT C T C C AG ACAT CAAACAAC AAT AT G GAGAAGG T T AC AT T G AAAAA 

l?M 

Jh AGTCTAGACAAACTGAAAGGCAATAAATCCTATGTGAACATGGACCTCTCTCCGGTGGTAGAGTG 
LLl CATGGACCACGCTCTAACAAGTCTCTTCCCTAAGACTCATTATGCCGCTGGAAAAGATGCCAAAA 
* T T T TC TGGAT AC C T C T G T C TC AC AT GCC AGCAGC T T TGCAAGAC T T T T TAT T GT T GAAAC AGAAA 

GCAGAGCTGGCTAATCCCAAGGCAGTGTGACTCAGCTAACCACAAATGTCTCCTCCAGGCTATGA 
%l AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 
go CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 
B CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 
fll GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 

TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 

AT C AT C T C T TAT C T AAAT AT TAAAAGATAAGTCAAC C C AAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAA 
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FIGURE 20 



MLFWVLGLLILCGFLWTRKGKLKIEDITDKYIFITGCDSGFGNLAARTFDKKGFHVIAACLTESG 
STALPCAETSERLRTVLLDVTDPENVKRTAQWKNQVGEKGLWGLINNAGVPGVLAPTDWLTLEDY 
RE P I EVHL FGL I S VT LNML PL VKKAQGRVI NVS S VG GRLAI VGGG YT P S KYAVE G FND S LRRDMK 
AFGVHVS C I E PGL FKTNLADP VKVI EKKLAI WE QLS PD IKQQYGEGYI EKS LDKLKGNKS Y VNMD 
LS P WECMDEALT SLFPKTHYAAGKDAKI FWI PLSHMPAALQDFLLLKQKAELANPKAV 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 



ffl amino acids 136-152 



N-glycosylation sites* 



"'SI. amino acids 161-163, 187-190 and 253-256 

gj| Glycosaminoglycan attachment site. 

U! amino acids 39-42 

m 



N-myristoylation sites. 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 



CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GG T GAAGCC AAG AAC AGCAT TACT GAT T CC CAAAT GGAT GAT G T T GAAGT T G T T TATACAAT T GA 
CATTCAGAAATATATTCCATGCTATCAGCTTTTTAGCTTTTATAATTCTTGAGGCGAAGTAAATG 
AGCAAGCACTGAAGAAAATATTATCAAATGTCAAAAAGAATGTGGTAGGTTGGTACAAATTCCGT 
CGTCATTCAGATCAGATCATGACGTTTAGAGAGAGGCTGCTTCACAAAAACTTGCAGGAGCATTT 
T T CAAACCAAGACC T TGTTTTTC T GC T AT TAACACC AAGT AT AAT AACAGAAAGCT GC T C TACT C 
ATCGACTGGAACATTCCTTATATAAACCTCAAAAAGGACTTTTTCACAGGGTACCTTTAGTGGTT 
GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
TTT T AG C CGAG C AG T ACAAAC AC AC AGC T C T AAAT T T T T T GAAGAAG AT GGAT CC T T AAAG GAG G 
TACATAAGATA&AT GAAATGTAT GCT T CAT TACAAGAGGAAT TAAAGAGTATATGCAAAAAAGTG 
|f| GAAGACAG TGAACAAGCAG TAGATAAACT AGTAAAGGAT GT AAAC AGAT TAAAACGAGAAAT TGA 

id GAAAAGGAGAGGAGCACAGATTCAGGCAGCAAGAGAGAAGAACATCCAAAAAGACCCTCAGGAGA 
ACAT TTTTCTTTGT CAGGC AT T ACGGACCTTTT TTC CAAAT TC TGAAT T TC T T CAT T CAT G T G T T 
ATGTC T T TAAftAAATAGACATGT TT C T AAAAGTAGCTGTAACTACAACCACCATCTCGAT G TAGT 
AGACAAT CT GAC C T T AAT G G TAGAACACAC T G AC AT TC C T GAAGC TAGT CC AGCTAGTAC ACCAC 
Q AAATCATTAAGCATAAAGCCTTAGACTTAGATGACAGATGGCAATTCAAGAGATCTCGGTTGTTA 

ill GATACACAAGACAAACGATC TAAAGCAAATAC TGGT AGTAGTAACCAAGATAAAGCATCCAAAAT 

f*\ 

»jg GAGCAGCCCAGAAACAGATGAAGAAAT T GAAAAGATGAAGGGTT TTGG T GAATAT TCACGG T C T C 

fi\ 

C TACAT T T TGAT C C T T T T AACCT T ACAAGG AGAT TTTTTTATTTGGCT GATG GG TAAAGC CAAAC 
p| AT TTC TAT T G T T T T TAC TATG T TGAG C TAC T TGCAGTAAGT TCAT T T GT T T T T ACT AT GT T CAC C 

TGT T TGCAGTAAT ACACAGATAACT CT TAGTGCAT TTACTTCACAAAG TACT TTTTCAAACATCA 
GATGCTTTTATTTCCAAACCTTTTTTTCACCTTTCACTAAGTTGTTGAGGGGAAGGCTTACACAG 
ACACATTCTTTAGAATTGGAAAAGTGAGACCAGGCACAGTGGCTCACACCTGTAATCCCAGCACT 
T AGGGAAGACAAGT CAGGAGGAT TGAT TGAAGCTAGGAG T TAGAGAC CAGCCTGGGCAAC G TAT T 
GAGACCATGTC TAT TAAAAAATAAAAT G GAAAAGCAAGAATAGCCT TAT T TT CAAAATATG GAA& 
GAAATTTATATGAAAATTTATCTGAGTCATTAAAATTCTCCTTAAGTGATACTTTTTTAGAAGTA 
CAT TAT GGC TAG AGT T GCCAGAT AAAAT GC T GGATATC AT GCAATAAAT T T GCAAAACAT CAT C T 
AAAATTTAAAAAAASAAAAAAAAAAAAAA 



fa 

n. 



m 
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FIGURE 22 



MEGESTSAVLSGFVLGALAFQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVSWYTIDIQKYI 
PCYQLFSFYNSSGEVNEQALKKILSNWKNW^ 

VFL LL T P S 1 1 TE S C S T HRLE H S L YKP QKGL FHRVPL WANL GMS E QL G YKT VS G S CMS T G FS RAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVNRLKREIEKRRGA 
QIQAAREKNIQKDPQENIFLCQALRTFFPNSEFLHSCVMSLKNRPIVSKSSCNYNHHLDWDNLTL 
MVF1HTDIPEASPASTPQIIKHKALDLDDRWQFKRSRLLDTQDKRSKANTGSSNQDKASPMSSPET 
DEEIEKMKGFGEYSRSPTF 



Important features: 
Signal peptide: 

CI 

.p'| ammo acids 1-19 

1 

J-Mj N-glycosylation sites. 

amino acids 75-79, 322-326 

%|! 

amino acids 184-154 

J'fl 

pn 



N-royristoylation site. 



Growth factor and cytokines receptors family. 



S amino acids 134-150 
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FIGURE 23 



GGCACAGCCGCGCGGCGGAGGGCA^GTCAGCCGAGCCGAGTCCAGCCGGACGAGCGGACCAGCGCAGGGCAGCCCAA 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCACCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 
AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 
AACAAGAACGATGCCCCCCTCCACGAGATCAACGGTGATCATTTGAAGATCTGTCCCCAGGGTTCTACCTGCTGCTCT 
CAAGAGAT GGAGGAGAAGT ACAGCCT GCAAAGT AAAGAT GATTT CAAAAGT GT GGTCAG C GAACAGT GCAAT CATTTG 

t . CAAGCTGTCTTTGCTTC^CGTTACAAGAAGTTTGATGAATTCTTCAAAGAACTACTTGAAAATGCAGAGAAATCCCTG 

M 

AJVTGATATGTTTGTGAAGACATATGGCCATTTATACATGCAAA^TTCTGAGCTATTTAAAGATCTCTTCGTAGAGTTG 

kJ 

AAACGTTACTACGTGGTGGGAAATGTGAACCTGGAAGAAATGCTAAATGACTTCTGGGCTCGCCTCCTGGAGCGGATG 



''■■--*■ 

if! 
r\ 

Bi 
O 



TTCCGCCTGGTGAACTCCCAGTACCACTTTACAGATGAGTATCTGGAATGTGTGAGCAAGTATACGGAGCAGCTGAAG - 



m 

j". | CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 
|J% TTAGCGGTTGCGGGAGATGTCGTGAGCAAGGTCTCCGTGGTAAACCCCACAGCCCAGTGTACCCATGCCCTGTTGAAG 
ATGATCTACTGCTCCCACTGCCGGGGTCTCGTGACTGTGAAGCCATGTTACAACTACTGCTCAAACATCATGAGAGGC 
TGTTTGGCCAACCAA.GGGGATCTCGATTTTGAATGGAACAATTTCATAGATGCTATGCTGATGGTGGCAGAGAGGCTA 
GAGGGT CCTTT CAACATT GAAT C G GT CAT GGAT C CCATC GAT GT GAAGATTT CT GAT GC TATTAT G AACAT GCAGGAT 
AATAGTGTTCAAGTGTCTCAGAAGGTTTTCCAGGGATGTGGACCCCCCAAGCCCCTCCCAGCTGGACGAATTTCTCGT 
TCCATCTCTGAAAGTGCCTTCAGTGCTCGCTTCAGACCACATCACCCCGAGGAACGCCCAACCACAGCAGCTGGCACT 
AGTTTGGACCGACTGGTTACTGATGTCAAGGAGAAACTGAAACAGGCCAAGAAATTCTGGTCCTCCCTTCCGAGCAAC 
GT TTGCAACGAT GAGAGGATGGCTGCAGGAAAC GGCAAT GAGGATGACT GTT GGAAT GG GAAAGG CAAAAGCAG GTAC 
CTGTTTGCAGTGACAGGAAATGGATTAGCCAACCAGGGCAACAACCCAGAGGTCCAGGTTGACACCAGCAAACCAGAC 
ATACTGATCCTTCGTCAAATCATGGCTCTTCGAGTGATGACCAGCAAGATGAAGAATGCATACAATGGGAACGACGTG 
GACTTCTTTGATATCAGTGATGAAAGTAGTGGAGAAGGAAGTGGAAGTGGCTGTGAGTATCAGCAGTGCCCTTCAGAG 
TT TGACTACAAT GCCACT GAC CATGCT GGGAAGAGT GCCAATGAGAAAGC C GACAGT GC TGGTGT CCGT CCT GGGGCA 
CAGGCCTACCTCCTCACTGTCTTCTGCATCTTGTTC^^ 

AAAAAGTGTTCATCAAAAAGTTAAAAGGCACCAGTTATCACTTTTCTACCATCCTAGT GACTTT GCTTTTTAAATGAA 
T G GACAACAATGTACAGTTTTTACTAT GTGGC CACTGGTTTAAGAAGT GCTGACTTT GTTTT CTCATTCAGTTTTGGG 
AGGAAAAGGGACTGTGCATTGAGTTGGTTCCTGCTCCCCCAAACCATGTTAAACGTGGCTAACAGTGTAGGTACAGAA 
CTATAGTTAGTTGTGCATTTGTGATTTTATCACTCTATTATTTGTTTGTATGTTTTTTTCTCATTTCGTTTGTGGGTT 
TTTTTTTCCAACTGTGATCTCGCCTTGTTTCTTACAAGCAAACCAGGGTCCCTTCTTGGCACGTAACATGTACGTATT 
T CTGAAATAT TAAATAGCT GTACAGAAG CAGGTTTTATTT AT CAT GT TAT CTTATTAAAAGAAAAAG C C CAAAAAGC 
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FIGURE 24 



MARFGLPALLCT1AVLSAALIAAELKSKSCSEVRRLYVSKGFNKNDAPLHEINGDHLKICPQGST 
CCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMFVKTYGH 
LYMQNSELFKDLFVELKRYYWGNTOLEEMLNDFWARLLERMFRLVNSQYHFTDEYLECVSKYTE 
QLKPFGDVPRKLKLQVTMFVAARTFAQGLAVAGDWSKVSVYNPTAQCTHALLKMIYCSHCRGL 
VT VKPC YNYC SN IMRGCLANQGDLDFEWNNFI DAMLMVAERLEGP FN I E S VMDP IDVKI S DAIMN 
MQDNS VQVSQKVFQGCGPPKPLPAGRI SRS I SE SAFSARFRPHHPE ERPTTAAGTS LDRLVT DVK 
EKLKQAKKFWSSLPSNVCNDERMAAGNGNEDDCWNGKGKSRYLFAVTGNGLANQGNNPEVQVDTS 
KPDILILRQIMALRVMTSKMKNAYNGNDVDFFDISDESSGEGSGSGCEYQQCPSEFDYNATDHAG 
KSANEKADS AGVRPGAQAYLLTVFC I LFLVMQRE"WR 

Important features: 
Signal peptide: 

amino acids 1-22 



ATP/GTP-binding site motif A (P-loop) 

Vp. amino acids 515-524 



N-glycosylation site. 



m amino acids 514-518 



f~| Glycosaminoglycan attachment sites. 

Ill amino acids 494-498, 498-502 



N-myri s toylation si tes . 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins. 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 25 



CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCTGAC 
CTGAGTCATCCCCAGGGATCAGGAGCCTCCAGCAGGGAACCTTCCATTATATTCTTCAAGCAACT 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGCTTCTAGGAGATGGCTCCAGGAAGGCGGCCAAGAATGTGAGTGCAAAGATTGGTTCCTGAG 
AGCCCCGAGAAGAAAATTCATGACAGTGTCTGGGCTGCCAAAGAAGCAGTGCCCCTGTGATCATT 
T CAAGGGCAATGTGAAGAAAACAAGAC ACCAAAGGCACCACAGAAAGCCAAACAAGCAT TCCAGA 
GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTTGTAGGAGCTCTG 
AGCGCCCACTCTTCCAATTAAACATTCTCAGCCAAGAAGACAGTGAGCACACCTACCAGACACTC 
T T C TT CT CCC ACCT CACT CTCCCACT GTAC CCAC CC C T AAAT CAT T C CAG T GC T C TCAAAAAGC A 
TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
AGCT AG T GT CAT TTAACCT TAAAT GCAAT CAGGAAAGTAGCAAACAGAAGT CAAT AAAT AT T T T T 



|f1 AAATGTCAAAAAAAAAAAAAAAAAA 



m 
II 
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FIGURE 26 

MKVLI S SLLLLLPLMLMSMVS S S LNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 

Important features: 
Signal peptide: 

amino acids 1-22 

N-myristoylation sites, 

amino acids 27-33, 46-52 
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FIGURE 27 



GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 
AGCTGGTCTGC CATG GACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
TGGCCGTGCTGACTCCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAGCTCTTCAGCCAG 
ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 
AG T T CC T GACAAAGAGC ATGGC T GAGAACAGGCACC T C CAATAT G AGCGG T T TG TGGTGGC T CCT 
GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG. 
CT CTGTGCAGAGCCCAAGGAAGGTC CT GCAGGAGG T C C GG AGAG TAC T GAG AC C GGGAGG T GT GC 
TCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTCATGTGGCAGCAAGTTTTC 
GAGCCCACCTGGAAACACAT TGGGGAT GGCTGC TGC CTCACCAGAGAGACCTGGAAGGAT CT TGA 
GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 
j A GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 
fM T TCCCCAGCCTCCAAT TAGAACAAGCCACCCACCAGCC TATCTATC TT CCACTGAGAGGGACCTA 

ifl. 



GCAGAATGAGAGAAGACATTCATGTACCACCTACTAGTCCCTCTCTCCCCAACCTCTGCCAGGGC 
AATCTCTAACTTCAATCCCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGG 
AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 



|1| CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 

0 CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 

ttl CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 

>f! AT G CC AGAGC AAG ACT C AAAGAGG C AGAGG T T T TG T TC TC AAATAT T T T T T AATAAAT AGAC GAA 

hi 

ACCACG 



App_ID=10063699 



Page 181 of 322 



FIGURE 28 

MDILVPLLQLLVLLLTLPLHLMALLGCWQPLCKSYFPYIjMAVLTPKSNRKMESKKRELFSQIKGL 
TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAEmHLQYERFWAPGEDM 
RQLADG SMDVWCT LVXCS VQS PRKVLQEVRRVLRPGGVL FFWEHVAE P YGS WAFMWQQVFE P T W 
KHIGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSFPSSKALICSFPSL 
QLEQATHQPIYLPLRGT 



m 



ifl 

HI 

I 



Important features; 
Signal peptide: 

amino acids 1-23 

Leucine zipper pattern. 

amino acids 10-32 

N-my ri s toy! a t i on sites. 



yj amino acids 64-70, 78-84, 80-86, 91-97, 201-207 



fi 
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FIGURE 2Q 



CAATGTTTGCCTA.TCCACCTCCCCCAAGCCCCTTTACCT ATG CTGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCT^ 
AGAAAGCTGGGAGATAGGGAACAGAAGAGGGTAGTGGGTGGGCTAGGGGGGCTGCCTTATTTAAA 
GTGGT TGTT TAT GATT CT TATACTAATT TATACAAAGATAT TAAGGCCCTGTTCAT TAAGAAATT 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
AGT TAAAAGCTGAAAAAAAAAAAAAAAAAAAAAAAAAAA 



m 



o 
m 
m 
m 

m 
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FIGURE 30 

MLLLTLLLLLLLLKGSCLEWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPAMTCSQAQPRG 
EGEKVGDG 



Important features: 
Signal peptide: 

amino acids 1-15 



Growth factor and cytokines receptors family: 

amino acids 3-18 

H* 
P 
El 

If! 
lit 
# 

3 

p| 

if! 
13 

P 
III 
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FIGURE 31 



G T T T GAAT T C C T T C AAC TAT AC C C AC AG T C C AAAAGC AG AC T C AC T GTGT C CCAGGC TAC CAG T T 
CCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
AT T T GCAT C T G T T T T GATAAATGAT GT T GAC ACC C T C CACC GAAT TC TAAGTGGAATCATGT CGG 
GAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCCATGATGTTTACC 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
TGTCTGCGGTGT T T TATGGT GGCTGTATTAT GACTATACCAAC GACC TCAGCATAGAAT TG GACA 
CAGAAAGGGAAZUVTATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 
C TGC T CGTCT TGAT TTTTGTTCT C AGAAAG AG AAT AAAAT T GACAGT T GAGCTT TT CCAAAT CAC 
AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 

u 

%^ TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
m ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 

l s 

m AATTGGCCTCATCTGGACTAGTGAATTCATCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAG 

b| TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 

& 1 TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 

^ GATTCCGAGAAT CAT T GTC AT GT ACAT G CAAAAC GCACT GAAAGAAC AGCAGCATGGTGCAT T GT 

m 

CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 
Q AACCAGAATGCATATAC T ACAACTGC TAT TAAT GGGAC AGAT T T CTGTACATCAGCAAAAGAT GC 

ill AT T CAAAATC T T GT CCAAGAAC T CAAGTCAC TT TACAT C TAT TAAC T GCT T TGGAGAC T T CAT AA 

M TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 
p| AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
jEjjj CCATAGTTTTTTATCTGTGTTTGA7U\CTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
T GGAAACAAAT GAT G GATCGTCAGAAAAGCCC TACT T TAT G GATC AAGAAT T T C T GAG T T T C G T A 
AAAAGGAGCAACAAATTAAACAATGCAAGGGCACAGCAGGACAAGCACTCATTAAGGAATGAGGA 
GGGAACAGAACT CC AGGCCATTGT GAGATAGATACCCATTTAGGTATCT GTACCTGGAAAACAT T 
TCCTTCTAAGAGCCATTTACAGAATAGAAGATGAGACCACTAGAGAAAAGTTAGTGAATTTTTTT 
■ TTAAAAGACCTAATAAACCCTATTCTTCCTCAAAA 
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FIGURE 22 



m 



MS GRDT I LGLC I 1ALALS LAMMFT FRF ITT LLVH I FI S LV I LGLL FVCGVLWWLY YDYTNDLS I E 
LD T ERENMKC VL G FAI VS TGI T AVL LV L I FVLRKR I KLT VE L FQ I TNKA I S S AP FL L FQ P L WT FA 
I LI FFWVLWVAVLLS LGTAGAAQVMEGGQVE YKPLSG IRYMWS YHL I GL I WT S E F I LACQQMT I A 
GAWTCYFNRSKNDPPDHPILSSLSILFFYHQGTWKGSFLISWRIPRIIVMYMQNALKEQQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
FI IFLGKVLWCFTVFGGIKAFOTNRAFQWAVPLLLVAFFAYLVAHS FLSVFETVLDALFLCFA 
VDLE TNDGS S EKPYFMBQE FL S FVKRSNKLNNARAQQDKHS LRKEEGTE LQAI VR 



! B ' :k; Signal peptide: 

0 

m 



Important features: 
Signal peptide: 

amino acids 1-20 



Putative transmembrane domains: 

amino acids 35-54, 75-97, 126-146, 135-204, 333-350, 352-371 

N-glycosylation sites. 

amino acids 204-208, 295-299, 313-317 

N-myr i s toyl a ti on sites. 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 
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FIGURE 33 



GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTTAGA 
ATAATTTGTATGGGATTTGTGATGCAGGAAAGCCTAAGGGAAAAAGAATATTCATTCTGTGTGGT 
GAAAATTTTTTGAAAAAAAAATTGCCTTCTTCAAACAAGGGTGTCATTCTGATATTTATGAGGAC 
TGTTGTTCTCACTATGAAGGGATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 
ATTCAAACAAAGAAACGGCAAAGAAGATTAAAAGGCCCAAGTTCACTGTGCCTCAGATCAACTGC 
GAT G T C AAAGCC GGAAAGATC AT CGATC C TGAGT TC AT TG T GAAAT G T C CAGCAGGATGC CAAG A 
CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 
TACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTTGTTCGGAAGGTTGCTGGACAGTCT 
GGT TACAAAGGGAGTTAT TCCAACGGTGTCCAATCGT TAT CCCTACCACGATGGAGAGAATCCTT 
TAT CGTC T T AGAAAG T AAACCCAAAAAG GGT G TAAC C T AC CCAT CAGCT C T TACATAC T CAT CAT 
CGAAAAGTCCAGCTGCCCAAGCAGGTGAGACCACAAAAGCCTATCAGAGGCCACCTATTCCAGGG 
ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 
CACCTTGCCAAGGCCATCCCCTTCTGCTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGG 
GCCACAGGAGCCAGGAGATGGATCTCTGGTCCACTGCCACCTACACAAGCAGCCAAAACAGGCCC 
AGAGCTGATCCAGGTATCCAAAGGCAAGATCCTTCAGGAGCTGCCTTCCAGAAACCTGTTGGAGC 
GGATGT CAGC CT GGGACT TGT TCCAAAAGAAGAAT TGAGCACACAGT CTT TGGAGC CAG TAT CCC 
TGGGAGATCCAAACTGCAAAATTGACTTGTCGTTTTTAATTGATGGGAGCACCAGCATTGGCAAA 
CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 
CGGTCCACTGATGGGTGTTGTCCAGTATGGAGACAACCCTGCTACTCACTTTAACCTCAAGACAC 
ACACGAATTCTCGAGATCTGAAGACAGCCATAGAGAAAATTACTCAGAGAGGAGGACTTTCTAAT 
ffl GTAGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCGG 
iQ GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACSAAGTGGAGGAGGCTTCAA 
J\ GACTTGCGAGAGAGTCAGGAATCAACATTTTCTTCATCACCATTGAAGGTGCTGCTGAAAATGAG 
AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACGGCTTCTACTC 
;; s , GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 
l ?s l ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 
111 GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 
p AGAG T T T GAGAT T T CCGACAC GGACACG CGCATC GGGGCC G T GCAGT ACACCT ACGAACAGCGGC 

TGGAGTTTGGGTTCGACAAGTACAGCAGCAAGCCTGACATCCTCAACGCCATCAAGAGGGTGGGC 
W . TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 
£3 GTCC7\AGCCCAACAAJGAGGAAGTTAATGATCCTCATCACCGACGGGAGGTCCTACGACGACGTCC 
f|| GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 
GCCCAAGAGGAGCTAGAAGTCATTGCCACTCACCCCGCCAGAGACCACTCCTTCTTTGTGGACGA 
GT TTGACAACCT CCAT CAGTATGTCCC CAGGATCATCCAGAACAT TT GT ACAGAGT TCAACT CAC 
AG C C T C GGAAC TGAAT T CAGAGC AGG C AGAGCACCAG CAAG T GCT GC T T T ACT AAC T GAC GT G T T 
GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 
AAATGTCTTGTTATTATTCTTTGCCATCATGCTTTTTCATATTCCAAAACTTGGAGTTACAAAGA 
T GATCACAAACGTATAGAATGAGCCAAAAGGC TACAT CAT G TTGAGGG T G CTGGAGATT T TACAT 
T T T GACAAT TGTT TTCAAAATAAATGT TCGGAATACAGTGCAGCCCT TACGACAGGCT T ACGTAG 
AGCTTTTGTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCAT 
TTTTGTCATGACAATGTAGGAATTGCTGAATTAAATGTTTAGAAGGATGAAAAATAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^AG 



f! 
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FIGURE 34 

MRTWLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKAGKIIDPEFIVKCPAG 
CQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSLSLPRWR 
ES FI VLESKPKKGVT YPSALT YS S SKSPAAQAGE TTKAYQRPP I PGTTAQPVTLMQLLAVTVAVA 
TPTTLPRPSP S AAS T T S I PRP QS VGHRS QEMDL W S TAT YT S S QNRPRAD PG I QRQD P SGAAFQKP 
VGADVSLGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIGKRRFRIQKQLLADVAQALDI 
GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 
RS GAPNWWMVDGWPTDKVEEASRLARESGINI FFI T IEGAAENEKQ YWEPNFANKAVCRTNG 
FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 
LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 
FKKS KPNKRKLMI L I TDGRS YDDVRI PAMAAHLKGVI TYAI GVAWAAQEELEVI ATHPARDHS FF 
VDEFDNLHQYVPRIIQNICTEFNSQPRN 

Important features : 
Signal peptide: 

amino acids 1-2.6 

Transmembrane domain: 

amino acids 181-200 

N-glycosylation sites. 

amino acids 390-394, 520-524 

N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Amidation site. 

amino acids 304-308 
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FIGURE as 



CC GAGC AC AG GAGAT T GC CT GCGT TT AG GAGGTGGC T GCGTT GT GGGAAAAGC TAT CAAGGAAGAAATTGC 
CAAACCATGTCTTTTTTTCTGTTTTCAGAGTAGTTCACAACAGATCTGAGTGTTTTAATTAAGCATGGAAT 
ACAGAAAACAACAAAAAACTTAAGCTTTAATTTCATCTGGAATTCCACAGTTTTCTTAGCTCCCTGGACCC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GGATGTCACTGAGATCCCTCAAATGGAGCCTCCTGCTGCTGTCACTCCTGAGTTTCTTTGTGATGTGGTAC 
CTCAGCCTTCCCCACTACAATGTGATAGAACGCGTGAACTGGATGTACTTCTATGAGTATGAGCCGATTTA 
CAGACAAGACTTTCACTT CACACTTC GAGAGCATTC AAACTGCTCTCATCAAAATCCATTTCTGGTCATT C 
TGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTACTTGGGGTGAAAAAAAGTCT 
TGGT GGGG AT AT GAGGT T CTT AC ATT TT TCTT AT T AGGCC AAGAGGCT GAAAAGGAAG ACAAAATGTTGGC 
ATTGTCCTTAGAGGATGAACACCTTCTTTATGGTGACATAATCCGACAAGATTTTTTAGACACATATAATA 



f*\ ACCTGACCTTGAAAACCATTATGGCATTCAGGTGGGTAACTGAGTTTTGCCCCAATGCCAAGTACGTAATG 



AAGACAGACACT GATGT T TT CATC AATACTGGCAAT T T AGTG AAGTAT CT TT T AAACCTAAACC AC TCAGA 
GAAGTT TTTC AC AGGT T ATCCTCT AATT GATAAT TATT CC T AT AGAGGAT TT TACC AAAAAACCC ATATT T 
CT TACC AGGAGT AT CCTT TC AAGGTGTT CCCT CCAT ACT GCAGT GGGTTGGGTT AT ATAATGT CCAGAGAT 
T T GGT G CCAAGG AT C TAT GAAATGAT GGGTCACGT AAAAC CC ATC AAGTT TG AAGATGTTT AT GTC GGGAT 
CTGT TT GAATTTATTAAAAGTGAACATT C ATATT CCAGAAGACACAAATCTT TTCTTT CTAT AT AGAAT CC 
ATTTGGATGTCTGTCAACTGAGACGTGTGATTGCAGCCCATGGCTTTTCTTCCAAGGAGATCATCACTTTT 
T GGC AGGTC ATG CT AAG GAAC AC CAC AT GCCATT AT TAACTTCACATT CT ACAAAAAG CCT AGAAGGACAG 



■M 

HI 
W 

If! 

3 

I M G AT ACCTTGT GG AAAGT GTT AAAT AAAGT AGGT ACT GT GGAAAATTCATGGGGAGGTC AGT GTGCT GGCTT 

Ml 

Ft ACACTGAACTGAAACT CAT GAAAAAC CC AGAC TG G AGACT GG AGGGTT AC ACTTGT GAT T T AT T AGTC AGG 

M 

||1 CC CT TC AAAG AT GAT AT G TGGAG GAATT AAAT AT AAAGGAATTGGAGGTT TT T GCT AAAGAAAT TAAT AGG 

Q AC CAAACAAT TT GGAC AT GT CATT CT GTAGAC TAGAAT TT CT TAAAAGGGT GT T ACTGAGTT AT AAGCTC A 

CT AGGC T GTAAAAACAAAACAATGTAGAGTTT TATT TATT GAAC AATGTAGT CACTT G AAGGT T TT GTGT A 
TATCTTATGTGGATTACCAATTTAAAAATATATGTAGTTCTGTGTCAAAAAACTTCTTCACTGAAGTTATA 
CTGAACAAAATTTTACCTGTTTTTGGTCATTTATAAAGTACTTCAAGATGTTGCAGTATTTCACAGTTATT 
AT TATT T AAAAT TACT T C AACTTT GT GT T TTT AAAT GT TT TG ACGATT TC AAT AC AAG AT AAAAAGGAT AG 
TGAAT C ATTCTTTAC AT GCAAACATT TT CC AG TT ACTT AACT GATCAGTT TAT T ATTG ATACAT CACTC C A 
TT AAT GT AAAGT CAT AGGTC ATTATT GC ATAT CAGT AAT C TCTTGG ACTT T GTT AAAT AT TT TACT GTGGT 
AATATAGAGAAG AATT AAAG CAAGAAAAT CTG AAAA 
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FIGURE 36 



MASALWTVLPSRMSLRSLKWS LLLLSLLSFFVMWYLSLPHYISrVIERVNl#4YFyEYEPIYRQDFHF 
TLREHSNCSHQNPFLVILVTSHPSDVKARQAIR 

LS LEDEHLL YGD I IRQDFLDT YNNLT LKT IMAFRWVTE FCPNAK YVMKT D T DVFI NTGNL VK YL L 
NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 
KPIKFEDVWGICLNLLKVNIHIPEDTNLFFLYRIHLDVCQLRRVIAAHGFSSKEIITFWQVMLR 
NTTCHY 



Important features: 
Typ© II transmembrane domain: 

amino acids 20-39 

N-glycosylation sites. 

9\ , amino acids 72-76, 154-158, 198-202, 212-216, 326-330 



Glycosaminoglycan attachment site. 

fe|f amino acids 239-243 



m 
a 

HI 
1-1 



Ly-6 / u-PAR domain proteins. 

amino acids 23-37 

N-myristoylation site. 

amino acids 271-277 
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FIGURE 37 



CGCTCGGGCACCAGCCGCG6CAAG GATGG AGCTGGGTTGCTGGACGCAGTTGGGGCTCACTTTTCTTCAGCTCCTTCTCATC 
TCGTCCT TGCCAAGAGAGT ACAC AGTCAT TAATGAAGCCTGCCC T GG AGCAG AG T G GAAT AT CAT GTGT CGGGAGT GCT GT G 
AATATGATCAGATTGAGTGCGTCTGCCCCGGAAAGAGGGAAGTCGTGGGTTATACCATCCCXTGGTGCAGGAATGAGGAGAA 
TGAGTGTGACTCCTGCCTGATCCACCCAGGTTGTACCATCTTTGAAA^CTGCA^GAGCTGCCGAAATGGCTCATGGGGGG&T 
ACCTTGGATGACTTCTATGTGAAGGGGTTCTACTGTGCAGAGTGCCGAGCAGGCTGGTACGGAGGAGACTGCATGCGATGTG 
GCCAGGTTCTGCGAGCCCCAAAGGGTCAGATTTTGTTGGAAAGCTATCCCCTAAATGCTCACTGTGAATGGACCATTCATGC 
TAAACCTGGGTTTGTCATCCAACTAAGATTTGTCATGTTGAGTCTGGAGTTTGACTACATGTGCCAGTATGACTATGTTGAG 
GTTCGTGATGGAGACAACCGCGATGGCCAGATCATCAAGCGTGTCTGTGGCAACGAGCGGCCAGCTCCTATCCAGAGCATAG 
GATCCTCACTCCACGTCCTCTTCCACTCCGATGGCTCCAAGAATTTTGACGGTTTCCATGCCATTTATGAGGAGATCACAGC 
ATGCTCCTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTGCCTGCTTGGCAGGC 
TATAC TGGGCAGCGC TGTGAAAAT C TCCT TGAAGAAAGAAAC TG C TC AGAC C C TGGGGGCC CAGTCAATGGGTACCAGAAAA 
TAACAGGGGGCCCTGGGCTTATCAACGGACGCCATGCTAAAATTGGCACCGTGGTGTCTTTCTTTTGTAACAACTCCTATGT 
TCTTAGTGGCAATGAGAAAAGAACTTGCCAGCAGAATGGAGAGTGGTCAGGGAAACAGCCCATCTGCATAAAAGCCTGCCGA 
GAACCAAAGATTTCAGACCTGGTGAGAAGGAGAGTTCTTCCGATGCAGGTTCAGTCA^GGGAGACACCATTACACCAGCTAT 
ACTCAGCGGCCTTCAGCAAGCAGAAACTGCAGAGTGCCCCTACCAAGAAGCCAGCCCTTCCCTTTGGAGATCTGCCCATGGG 
ATACCAACATCTGCATACCCAGCTCCAGTATGAGTGCATCTCACCCTTCTACCGCCGCCTGGGCAGCAGCAGGAGGACATGT 
CTGAGGACTGGGAAGTGGAGTGGGCGGGCACCATCCTGCATCCCTATCTGCGGGAZ^TTGAGAACATCACTGCTCCAAAGA 
CCCAAGGGTTGCGCTGGCCGTGGCAGGCAGCCATCTACAGGAGGACCAGCGGGGTGCATGACGGCAGCCTACACAAGGGAGC 
GTGGTTCCTAGTCTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGGGAAG 
%p GTCAC CATGAT C AAGAC AGC AGACCTGAAAGTTGTT TTGGGGAAATTCT AC C GGGATGAT GACCGGGATGAGAAGACCATC C 

_$ AGAGC CTACAGATTTC TGC T ATCAT TCTGGATCCCAACTATGACCCC AT CC TGCTT GATGC TGACATCGCCAT CC T GAAGC T 

Q CCTAGACAAGGCCCGTATCAGCACCCGAGTCCAGCCCATCTGCCTCG<:TGCCAGTCGGGATCTCAGCACTTCCTTCCA^ 
HI TCCC ACATCAC T GTGGC TGGC TGGAATGT CC TGGC AGACGT GAGG AGCCCTGGCT T CAAGAACGACACACT GC GCTCTGGGG 

f g TGGTCAGTGTGGTGGAC TCGC TGCTGTGTGAGGAGCAGC ATGAGGACCATGGCATCCCAGTGAGTGTCACTGATAACATGTT 

CTGTGCCAGCTGGGAACCCACTGCCCCTTCTGATATCTGCACTGCAGAGACAGGAGGCATCGCGGCTGTGTCCTTCCCGGGA 
CGAGCATCTCC T GAGC CAC GC TGGCATC TGATGGGACTGGT CAGC TGGAGC TAT GAT AAAACATGCAGCCACAGGCTCTCC A 
C TGCCTT CACC AAGGTGCT GCC T T T TAAAGACTGGATTGAAAGAAATAT GAAAT GAACCAT GC TC ATGC ACTCCTTGAGAAG 
TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAAC'ETGGCT 
GTGCCAGGGCTTCTGACTTCAGGGACAAAACTCAGTGAAGGGTGAGTAGACCTCCATTGCTGGTAGGCTGATGCCGCGTCCA 
C TAC TAGGACAGCCAAT TGGAAGATGCCAGGGC TTGCAAGAAGTAAGT T TC TT C AAA3AAGACCATATACAAAAC C TCTCCA 
CTCCACTGACCTGGTGGTCTTCCCCAACTTTCAGTTATACGAATGCCATCAGCTTGACCAGGGAAGATCTGGGCTTCATGAG 
GCCCC TT TT GAGGC T C T CAAGT T C TAGAGAGCTGCC TGTGGGACAGC C C AGGGCAGCAGAGCTGGGATGTGGTGCATGCC T T 
TGTGTACAT GGCCACAG TACAGTC TGGTC CT TTTCCTTCCCCATC TC T T GT AC AC AT T T TAATAAAATAAGGGTTGGCTTC T 
GAACTACAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



O 

111 

m 
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FIGURE 38 



MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 
IPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGDCMKCGQ 
VLRAPKGQ I LLES YPLNAHCEWT IHAKPGFVI QLRFVMLSLE FDYMC QYDYVEVRDGDNRDGQ 1 1 
KRVCGNERPAPIQSIGSSLHVLFHSDGSKNFDGFHAIYEEITACSSSPCFHDGTCVLDKAGSYKC 
ACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTWSFFCNNSYVLSGNE 
KRTCQQNGEWSGKQPICIKACR5PKISDLVRRRVLPMQVQSRETPLHQLYSAAFSKQKLQSAPTK 
KPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGKWSGRAPSCIPICGKIENITAP 
KTQGLRWPWQAAIYRRTSGVHDGSLHKGAWFLVCSGALVNERTVWAAHCVTDLGKVTMIKTADL 
KWLGKFYRDDDRDEKTIQSLQrSAIILHPNYDPILLDADIAILKLLDKARISTRVQPICLAASR 
DLSTSFQESHITVAGWNVLADVRSPGFPCNDTLRSGWSWDSLLCEEQHEDHGIPVSVTDNMFCA 
SWEPTAPSDICTAETGGIAAVS FPGRA5PEPRWHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWI 
ERNMK 

Important features of the protein: 
Signal peptide: 

amino acids 1-23 
g, EGF-like domain cysteine pattern signature. 

|!| amino acids 260-272 

.Ml N-glycosylation sites. 

if amino acids 96-100, 279-283, 316-320, 451-455, 614-618 

& 

1^1 N-myristoylation sites. 

ffj amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 

474-480, 491-497, 638-644, 666-672 
Amidation site, 
amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 



m 

•js 
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FIGURE 3Q 



GGTT CCTACAT C CTCT CAT CT GAGAAT CAGAGAGCATAAT CTT CTTACG GGCCC GTGATTTATT AACGT GGCTT AAT C 
TGAAGGTTCTCAGTCAAATTCTTTGTGATCTACTGATTGTGGGGGCATGGCAAGGTTTGCTTAAAGGAGCTTGGCTGG 
TTTGGGCCCTTGTAGCTGACAGAAGGTGGCCAGGGAGAATGCAGCACACTGCTCGGAGAATGAAGGCGCTTCTGTTGC 
TGGTCTTGCCTTGGCTCAGTCCTGCTAACTACA.TTGACAATGTGGGCAACCTGCACTTCCTGTATTCAGAACTCTGTA 
AAGGTGCCT CCCACTACGGCCT GACCAAAGATAGGAAGAGGCGCT CACAAGATGGCTGTCCAGACGGCT GT GCGAGCC 
TCACAGCCACGGCTCCCTCCCCAGAGGTTTCTGCAGCTGCCACCATCTCCTTAATGACAGACGAGCCTGGCCTAGACA 
ACCCTGCCTACGTGTCCTCGGCAGAGGACGGGCAGCCAGCAATCAGCCCAGTGGACTCTGGCCGGAGCAACCGAACTA 
GGGCACGGCCCTTTGAGAGATCCACTATTAGAAGCAGATCATTTAAAAAAATAAATCGAGCTTTGAGTGTTCTTCGAA 
GGAC^AGAGCGGGAGTGCAGTTGCCAACCATGCCGACCAGGGC^GGGAAAATTCTGAAAACACCACTGCCCGTGAAG 
T CT TT CCAAGGT T GT ACCACCTGATT CCAGATGGTGAAATTAC CAGCAT CAAGAT CAAT CGAGT AGAT CCCAGTGAAA 
GCCTCTCTATTAGGCTGGTGGGAGGTAGCGAAACCCCACTGGTCCATATCATTATCCAACACATTTATCGTGATGGGG 
%£r TGATCGCCAGAGACGGCCGGCTACTGCCAGGAGACATCATTCTAAAGGTCAACGGGATGGACATCAGCAATGTCCCTC 
fj. ACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTGATGCGTGAACAGAAGTTCCGCA 
| jj GCAGGAACAAT GGACAGG C CCCGGAT GCCTACAGAC CC C GAGAT GACAG CT TT CAT GT GATT CT CAACAAAAGTAGCC 

f$f CCGAGGAGCAGCTTGGAATAAAACTGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCG 
fjj GTGTGGCATAT CGACATGGTCAGCTTGAGGAGAATGACCGTGT GTTAGCCATCAATGGACATGATCTTCGATAT GGCA 

|fl GCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCGTCGTGTCCCGCCAGGTTCGGCAGC 
il| GGAGC CCT GAGAT CT TTCAGGAAG CC GGCT G GAACAGCAAT GGCAG CT G GT CC CCAGGGCCAG GGGAGAGGAGCAACA 

k|j CTCCCAAGCCCCTCCATCCTACAATTACTTGTOTGAGAAGGTGGTAAATATCCAAAAAGACCCCGGTGAATCTCTCG 
r. GCATGACC GTC GCAGG GG GAG CAT CACATAGAGAAT GGGAT TT GCCTATCTAT GT CAT CAGT GTT GAGC CC GGAGGAG 

f3 . T CATAAGCAGAGATGGAAGAATAAAAACAGGTGACATTTTGTTGAATGTGGATGGGGT CGAACT GACAGAGGTCAGCC 

IJI GGAGT GAGGCAGTGGCATT ATT GAAAAGAACAT CAT CCT CGATAGTACT CAAAGCTTT G GAAGT CAAAGAG TAT GAGC 

Q CCCAGGAAGACTGCAGCAGCCCAGCAGCCCTGGACTCCAACCACAACATGGCCCCACCCAGTGACTGGTCCCCATCCT 
ffff GGGTCATGTGGCTGGAATTACCACGGTGCTTGTATAACTGTAAAGATATTGTATTACGAAGAAACACAGCTGGAAGTC 
Q TGGGCTTCT GCATTGT AGGAG GTTAT GAAGAATACAATG GAAACAAACCTT T TT T CAT CAAATCCATT GTT GAAGGAA 

f|| CACCAGCATACAAT GATGGAAGAATTAGATGTGGTGATATT CTT CTT GCT GT CAAT GGTAGAAGT ACAT CAGGAAT GA 

TACATGCTTGCTTGGCAAGACTGCTGAAAGAACTTAAAGGAAGAATTACTCTAACTATTGTTTCTTGGCCTGGCACTT 
TTTT ATAGA ATCAATGATGGGTCAGAGGAAAACAGAAAAATCACAAATAGGCTAAGAAGTTGAAACACTATATTTATC 
T T GTCAGT T T TTATATTTAAAGAAAGAAT ACATT GTAAAAATGT CAGGAAAAGTAT GATCAT CTAATGAAAGCCAGTT 
ACAC CT CAGAAAATAT GAT T C CAAAAAAATTAAAACTACTAGT TTT TT TT CAGT GTGGAGGATTTCT CATTACT CTAC 
AACAT T GT TTATATTT TTT CTATT CAATAAAAAGCC CTAAAACAACTAAAAT GATTGAT TT GTATAC C C CACTGAATT 
CAAGCT GATTTAAATTTAAAATT T GGTAT ATGCT GAAGTCT GC CAAGGGTACAT TAT GGCCATTTTTAATTTACAGCT 
AAAATATTTTTTAAAATGCATTGCTGAGAAACGTTGCTTTCATCAAACAAGAATAAATATTTTTCAGAAGTTAAA 
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FIGURE do 



MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVSAAATISLMTDEPGLDNPAYVSSAEDGQPAISPVDSGRSNRTRARPFERSTIRSRSFKKINR 
ALSVLRRTKS GSAVANHACQGRENSENTTAPE VFPRL YHL I PDGE I TS I KINRVDPSE S LS IRLV 
GGSETPLVHIIIQHIYRDGVIARDGRLLPGDIILKVNGMDISNVPHNYAVRL.LRQPCQVLWLTVM 
REQKFRSRNNGQAPDAYRPRDDSFHVILNKSSPEEQLGIKLVRKVDEPGVFIFNVLDGGVAYRHG 
QLEENDRVLAINGHDLRYGSPESAAHLI QASERRVHLVVSRQVRQRSPDI FQEAGWNSNGSWSPG 
PGERSNTPKPLHPTITCHEKVVNIQKDPGESLGMTVAGGASHREWDLPIYVISVEPGGVISRDGR 
IKTGDILLWDGVELTEVSRSEAVALLKRTSSSIVLKALEVKEYEPQEDCSSPAALDSNHNMAPP 
S DWS PSWVMWLE L PRC LYNCKD I VLRRNTAGS LG FC I VGGYEE YNGNKP F FI KS I VEG T P AYNDG 
RIRCGDILLAVNGRSTSGMIHACLARLLKELKGRITLTIVSWPGTFL 

13 Important features: 

fel* Signal peptide: 

•!!? amino acids 1-15 

pi 
>. r. 

v[| N-glycosylation sites. 

* amino acids 108-112, 157-161, 289-293, 384-388 

m 

%l Tyrosine kinase phosphorylation sites, 

amino acids 433-441, 4 92-500 

w 

f|| N-myristoylation sites. 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE dl 



ACCAGGCATTGTATCT TCAGT TGT CAT C AAG T T CGC AAT CAGAT T GGAAAAGCT CAACTT GAAGCTT T 
CTT GCCT GCAGTGAAGCAGAGAGAT AGATAT TATTCACGTAATAAAAAAC ATG GGCT TCAACCT GACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CACCAGTAACTACTTCGTGGGTGCCATTCAAGAGATTCCTAAAGCAAAGGAGTTCATGGCTAATTTCC 
ATAAGACCCTCATTTTGGGGAAGGGAAAAACTCTGACTAATGAAGCATCCACGAAGAAGGTAGAACTT 
GACAACTGTCCTTCTGTGTCTCCTTACCTCAGAGGCCAGAGCAAGCTCATTTTCAAACCAGATCTCAC 
T T T GGAAGAGGTACAGGCAGAAAAT CC C AAAGT G T CC AGAG GCC GGT AT CGCCCT CAGGAATGT AAAG 
CTTTACAGAGGGTCGCCATCCTCGTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAA 
CAT CTGC ATCCCTTC CTGCAGAGGCAGC AGCT GGATT AT G G CATC TACGT CAT CCACCAGGCT GAAGG 
TAAAAAGTTTAATCGAGCCAAACTCTTGAATGTGGGCTATCTAGAAGCCCTCAAGGAAGAAAATTGGG 
ACTGCTTTATATTCCACGATGTGGACCTGGTACCCGAGAATGACTTTAACCTTTACAAGTGTGAGGAG 
CATCCCAAGCATCTGGTGGTTGGCAGGAACAGCACTGGGTACAGGTTACGTTACAGTGGATATTTTGG 
GGGTGTTACTGCCCTAAGCAGAGAGCAGTTTTTCAAGGTGAATGGATTCTCTAACAACTACTGGGGAT 
GGGGAGGCGAAGACGATGACCTCAGACTCAGGGTTGAGCTCCAAAGAATGAAAATTTCCCGGCCCCTG 
C CT GAAGTGGGT AAAT ATAC AATGGTCTT CCAC ACTAGAGACAAAGGCAAT GAGGTGAAC GCAGAACG 
GAT GAAGCTCTTACACCAAGT GTC ACGAGTCTGGAGAACAGATG GGT T GAGTAGT TGTTC T TAT AAAT 
TAGTATCTGTGGAACACAATCCTTTATATATCAACATCACAGTGGATTTCTGGTTTGGTGCATGACCC 
TGGATCTTTTGGTGATGTTTGGAAGAACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAA 
ATAGTAGCACACATTAAGAACCTGTTACAGCTCATTGTTGAGCTGAATTTTTCCTTTTTGTATTTTCT 
TAGCAGAGCTCCTGGTGATGTAGAGTATAAAACAGTTGTAACAAGACAGCTTTCTTAGTCATTTTGAT 
CAT GAGGGTTAAATATTGT AATAT GGAT ACT TGAAGGACTT TATAT AAAAGGATGACTCAAAGGATAA 
AAT GAACGCTATTTGAGGACT CTGGTT GAAGGAGATT TATTTAAATT TGAAGTAATATATTATGGGAT 
AAAAGGC CACAGGAAATAAGACTGCTGAATGTC T GAGAGAACCAGAGT TGTTCTCGT CCAAGGTAGAA 
AGGTACGAAGATACAATACTGTTATTCATTTATCCTGTACAATCATCTGTGAAGTGGTGGTGTCAGGT 
GAGAAGGCG TC CACAAAAGAGGGGAGAAAAG GC GACGAAT CAGGACAC AGT GAACTT GGG AAT GAAGA 
GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 
CTTCAGGGGAGGACCTGCCCAGGTATGCCTTCCAGTGATGCCCACCAGAGAATACATTCTCTATTAGT 
T T T T AAAGAGT T T T T GT AAAATGAT TT T GTACAAGT AGGATAT GAAT T AGC AG T T TACAAGTT T ACAT 
ATT AACTAATAAT AAAT ATGT CTAT CAAATACC T CTGTAGTAAAATGT GAAAAAGCAAAA 
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FIGURE 42 

MG FNLTFH LS YKFRLLLLLTLCLT WGWATSNYFVGAI QE I PKAKE FMAN FHKTL I LGKGKT LTN 
EASTKKVELDNCPSVSPYLRGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQECKALQRVAILVPH 
RKREKHmYLLEHLKPFLQRQQLDYGIYVIHQAEGKKFNRAKLLNVGYLEALKEENWDCFIFHDV 
DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSRVWRTDGLSSCSYKLV 
SVEHNPLYINI TVDFWFGA 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 4-8, 220-224, 335-339 

Xylose isoraerase proteins. 

amino acids 191-202 
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FIGURE 43 



m 

m- 

m 

US 
,21 



Q 



& 
111 



GCTCAAGACCCAGCAGTGGGACAGCCAGACAGACGGCACGATGGCACTGAGCTCCCAGATCTGGG 

CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 

ACGGGACAACTTGCAGAGCTGCAACCCCAGGACAGAGCTGGAGCCAGGGCCAGCTGGATGCCCAT 

GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 

ATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTGCCCCCGTCCCCTCCC 

TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 

T CCAAAAA&AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE dd 

MALSSQIWAACLLLLLLLASLTSGSVFPQQTGQLMLQPQDRAGARASWMPMFQRRRRRDTHFPI 
CIFCCGCCHRSKCGMCCKT 

Important features: 
Signal peptide: 

amino acids 1-24 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 58-59 

N-myristoylation site. 

amino acids 44-50 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 1-12 
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FIGURE 45 



G TGG C T T CAT T T CAG T GGC T GAC T T CC AGAGAGC AATATGG CTGGTTCCC C AAC AT GC C T CAC C C 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GACCTTCAACACAACCCCTCTTGTCACCATACAGCCAGAAGGGGGCACTATCATAGTGACCCAAA 
ATCGTAATAGGGAGAGAGTAGACTTCCCAGATGGAGGCTACTCCCTGAAGCTCAGCAAACTGAAG 
AAGAATGACTCAGGGATCTACTATGTGGGGATATACAGCTCATCACTCCAGCAGCCCTCCACCCA 
GGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCA 
ATAAGAATGGCACCTGTGTGACCAATCTGACATGCTGCATGGAACATGGGGAAGAGGATGTGATT 
TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
y % CTGGAGATGGGGAGAAAGTGATATGACCTTCATCTGCGTTGCCAGGAACCCTGTCAGCAGAAACT 
f ! TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
IJI GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 



m 

m 



GT T TC TGAAGAGAGAGAGACAAGAAGAG T ACAT TGAAGAGAAGAAGAG AG T GGACATT T GTCGGG 
AAACT CCTAACATAT GCCCCCAT T C TGGAGAGAACACAGAG T ACGACACAATCCCTCACACT AAT 
AGAACAATCCTAAAGGAAGATCCAGCAAATACGGTT TACT CCACTGTGGAAATACCGAAAAAGAT 
ijj GGAAAATCCCCACT CACTGC TCACGAT G C CAGACACAC CAAGGC TAT TTG CCTAT GAGAA.TGT T A 

n TCTAGACAGCAGTGCACTCCCCTAAGTCTCTGCTCA 

P 



0 
CO 
S3 
fll 
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FIGURE 46 



MAGSPTCLTLIYILWQLTGSAASGPVKELVGSVGGAVTFPLKSKVKQVDSIVWTFNTTPLVTIQP 
E GGT 1 I VTQNRNRERVDFPDGG YS LKLSKLKKNDS G I YYVG I YS S S LQQPSTQE YVLHVYEHLSK 
PKVMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKALGQAANESHNGSILPISWRWGESDMTFIC 
VARNPVSRNFSSPILARKLCEGAADDPDSS1WLLCLLLVPLLLSLWLGLFLWFLKRERQEEYIE 
EKKRVDICRETPNICPHSGENTEYDTIPHTNRTILKEDPANTVYSTVEIPKKMEKPHSLLTMPDT 
PRLFAYENVI 

Important features: 
Signal peptide: 

,j, amino acids 1-22 

Transmembrane domain: 

amino acids 224-250 

by 

m 
f i a 

i.f| Leucine zipper pattern. 

%y amino acids 229-251 

T, 

r\ 

N-glycosylation sites. 

amino acids 98-102, 142-146, 148-152, 172-176, 176-130, 204-208, 
291-295 

i 
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FIGURE 47 



PI 

ru 



GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAATGG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
ATAGGAGCAGG T CT GATGGCCATTCCAGCAACAACAAT GT CCT TGACAGCAAGAAAAAGAGCGT G 
CTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCACAGTCATTGGTGCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 
AAC AG T AAT GC C AAT T G T GAAT T T T CAT T G AAAAAC AT CAG T G AC AT T CAT C CAGAAT C C T T CAA 
CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 
ACACC ATGGCGAG T GGCT GGAGAGCAT C TAGT T TCCACTT CGATTCT GAAGAAAACAAAC AT AGG 
CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
TG TAGT TTAATGGGAATAAAATGTAAGTATCA.GTAGTTTGAAAAAAAAAAA. 
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FIGURE d8 



MTCCEGWT SCNG FS LLVLL LLGWLNAI PL I VSLVEEDQFS QNP I S C FE WWFPG 1 1 GAGLMAI PA 
TTMSLTARKRAC CNNRTGMFLS S FFSVI TVI GALYCML I S I QALLKGPLMCNS P SNSNANCE FS L 
KNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMASGWRASSFHFDSEENKHRLIHFSVFLGL 
LLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 

Important features: 
Transmembrane domains: 

amino acids 10-31 (type II), 50-72, 87-110, 191-213 
| s 4 N-glycosylation sites. 

CI amino acids 80-84, 132-136, 148-152, 163-167 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 223-227 



W. 

m 

■ n ■ 

N-myristoylation sites. 

* w amino acids 22-28, 54-60, 83-89, 97-103, 216-222 

m 

m Prokaryotic membrane lipoprotein lipid attachment site. 

fit amino acids 207-218 

s 

TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 
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FIGURE aq 



ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAGTGAGGAACCTGCAGAGTCACACAGTTGCTGACCAATTGAGCTGTGAGCCTGGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CAT T T GC CAATAAAGACGAT CC C T T C TAC TAT GAC T GGAAAAACC T GCAGCT GAGCGGACT GAT C 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
CAGCCAGAAGCAGCACAGTCCTGTACCTGAGAAGGCCATCCCACTCATCACTCCAGGCTCTGCCA 
CTACT TGCTGAGCACAGGAC TGGC C T C CAGGGATGGCCT GAAGCCTAACACTGGCCCCCAGCACC 
TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 
(wb TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 

O 

m 
w 
m 



O 
if! 

a 

HI 
m 
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FIGURE so 



MERVTLALLLLAGLTALEANDPFANKDDPFYYDWKNLQLSGLICGGLLAIAGIAAVLSGKCKYKS 
S QKQHS P VPEKAI PL I TPGSATTC 

Important features : 
Signal peptide : 

amino acids 1-16 

Transmembrane domain: 

amino acids 3 6-59 

N-rnyristoylation sites. 

amino acids 41-47, 45-51, 84-90 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 

amino acids 54-67 
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FIGURE 51 



GTGGACTCTGAGAAGCCCAGGCAGTTGAGGACAGGAGAGAGAAGGCTGCAGACCCAGAGGGAGGG 

AGGACAGGGAGTCGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGCGGCAAGG 

AGGAGAC C CT GGT GGGAGGAAGACAC TCTGGAGAGAGAGGGGGCTGGG C AG AGATGAAGT T CCAG 

GGGCCCCTGGCCTGCCTCCTGCTGGCCCTGTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 

CGGAGAGGAAAGCACTGGGACAAATATTGGGGAGGCCCTTGGACATGGCCTGGGAGACGCCCTGA 

GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 

GCCCTTGGCCAAGGGACCAGAGAAGCA.GTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 

AGCAGATGCTTTGGGCAACAGGGTCGGGGAAGCAGCCCATGCTCTGGGAAACACTGGGCACGAGA 

TTGGCAGACAGGCAGAAGATGTCATTCGACACGGAGCAGATGCTGTCCGCGGCTCCTGGCAGGGG 

H" GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCGATGGCATCTTTGGCTCTCAAGGTGG 

13 CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 

JJJJ GAAACT CAGCAGGCAGC T TTGGAAT GAATC C T CAGGGAJ3CTCCCTGGG GTC AAGGAGGCAAT GGA 

'it's . 

GGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAG 

AG C CAGCAAC CAGAAT GAAGGG T G CAC GAAT C CC CC AC CAT CTGG CT CAG GT GGAGGC T C CAGC A 

ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 

AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 

TGGCGGCAGCAGTGGTGGCAGCAGTGGCAACAGTGGTGGCAGCAGAGGTGACAGCGGCAGTGAGT 

CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 

CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 

C T T CAGAG GACAGGGAGT TTCCAGC AACATGAGGGAAAT AAGCAAAGAGGGCAAT CGCCT CCT T G 



m 
m 
m 

n 

5 



GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 



GGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 
GAAT T T TAAATC CAAGC TGGG T T T CAT CAAC TGGG AT GCC AT AAACAAGGACCAGAGAAGCT C T C 
GCATCCCGTGACCTCCAGACAAGGAGCCACCAGATTGGATGGGAGCCCCCACACTCCCTCCTTAA 
AAC ACCACCC TC TC AT CAC T AAT CT CAGCC CTTGCCCTT GAAATAAACC T T AGC T GCC CC ACAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 52 

MKFQGPLACLLLALCLGSGEAGPLQSGEESTGTKIGEALGHGLGDALSEGVGKAIGKEAGGAAGSKVS 
EALGQGTREAVGTGVRQVPGFGAADALGNRVGEAAHALGNTGHE I GRQAEDVIRHGADAVRGSWQGVP 
GHSGAl)ffiTSGGHGIFGSQGGLGGQGQGNPGGLGTFWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGS 
S SGSS SGS S SGGS SGGS SGGS SGNSGGSRGDSGSES SWGS STGS SSGNHGGSGGGNGHKPGCEKPGNE 
ARGS GE S GI QGFRGQGVS SNMRE I S KEGNRLLGGSGDM YRGQGS SWGSGGGDAVGGVNTVN SET S PGM 
FNFDTFWKNFKSKLGFINWDAINKDQRSSRIP 



Signal peptide: 

t t amino acids 1-21 

htm 

m 

Q N-glycosylation site. 

ff] amino acids 265-269 

ill 

feM Glycosaminoglycan attachment site. 

'^f amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 

fell 

n . 

pi Casein kinase II phosphorylation site. 

Ifl amino acids 26-30, 109-113, 259-263, 300-304, 304-308 



m 

■ hnt 

111 N-myristoylation site. 

W amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

f|| 90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251 r 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-2S0, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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FIGURE 5 3 



GGAGAAGAGGTT G T GT GGGACAAGC TGC TCCCGACAGAAGGATGTCGC T GCTGAGCCTGCCCT GG 
CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 
CACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 
CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 
TTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGGCTGGGAGAAGGGATACTGCTG 
AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 
GAAGTCCTATATAACGATCTTCAACAAGAGTGCAAACATCATGCTTGACAAGTGGCAGCACCTGG 
CCT CAGAGGGCAGC AGTC GT C T GGAC ATGT T TGAGC AC AT CAGCCT CAT G ACCT TGGACAGT CT A 
C AGAAATGCATCT TCAGCT T T GACAGC CAT TGT C AGGAGAGGCCCAGTGAAT AT AT T GCCACCAT 
CTTGGAGCTCAGTGCCCTTGTAGAGAAAAGAAGCCAGCATATCCTCCAGCACATGGACTTTCTGT 



mi 

m 

%V* ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 



GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 
AGCCAAGTCCAAGACTTTGGATTTCATTGATGTGCTTCTGCTGAGCAAGGATGAAGATGGGAAGG 



ill 

Hi 

fell 

ij| CAT T GTCAGATGAGGATATAAGAG CAGAGGC TGACACCT T CATGT TT GGAGGCCATGACACCAC G 

*' GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 

lit ACAGGAGGT GCAAGAGCTTCT GAAGGACCGCGATCC TAAAGAGAT TGAATGGGACGACCT GGCC C 

I n 

Z* AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGrTACATCCCCCAGCTCCCTTCATC 
SI TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 

m 

f"| CCTCAT C GATAT TATAGGGGTCCATCACAACCCAAC TGT GTGGCCGGAT CC TGAGGT CTACGACC 

f|J CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 
GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 
GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 
GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 
CCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 
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FIGURE *a 

MS L LSLPWLGLRPVAMS PWLLLLLWGS WL LAR I LAWTYAFYNNCRRLQC FPQP PKRNWFWGHLG 
L I TPTEEGLKDSTQMSAT YSQGFTVWLGP 1 1 PFI VLCHPDT IRS I TNASAAI APKDNLFIRFLKP 
WLGEGILLSGGDKWSRHRBMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGSSRLDMFEHI 
SLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRAC 
RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 
MFGGHDTTASGLSWLYNLARHPEYQERCRQEVQELLKDRDPKEIEWDDLAQLPFLTMCVKESLR 
LHPPAPFI SRCCTQD IVLPDGRVI PKGI TCL I D 1 1 GVHHNP TVWPDPEVYDPFRFDPENSKGRSP 
LAFIPFSAGPRNCIGQAFAMAEMKWLALMLLHFRFLPDHTEPRRKLELIMRAEGGLWLRVEPLN 

C VGLQ 

1"! 

Important features: 
Transmembrane domains: 

amino acids 13-32 (type II) , 77-102 

Cytochrome P450 cysteine heme-iron ligand signature. 

amino acids .461-471 

hnf. 
tfl 

ISJ N-glycosylation sites. 

ill amino acids 112-116, 168-172 

m 
mi 



m 

ill 
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FIGURE ss 



m 

r\ 

;#! 

Co 
Ill 



AT C GCAT CAATT GGGAGTACCAT CT TCCT CATGGGACCAGTGAAACAGCTGAAGCGAAT GTTTGA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACTTGCACTTAXCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAA TTCATGGCCAGTTTTATGAAGCTTTGGAAGGCACTATGGACAGAAGCTGGTGGACAGTTTT 
GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
GT GATTCGAACATTTGAGGGTTACTTTTGGAAGCAACAATACAT TC TCGAACCTGAATGT CAGTA 
GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
TCTGGATGTTGTCCCACTGAATTCCCATGAATACAAACCTATTCAGCAACAGCAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 56 



MG PVKQLKRMFE PTRL I AT IMVLLC FALTLCS AFWWHNKGLAL I FC I LQS LALTWYS L S FI P FAR 
DAVKKCFAVCLA 

Important features: 
Signal peptide: 

amino acids 1-33 

| ;> ,| ; Type II fibronectin collagen -binding domain protein. 

D amino acids 30-72 

i'l 

m 
-4 - 

m 

40 



"r! 



CIS 

ru 
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FIGURE 57 



CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCCAGCAGGCCgCCAACATGCTCTGTCTGTGCCTG 
TAC GT G CC GGTCAT C GGGGAAGCCCAGAC CGAGTT C CAGTACT TT GAGTC GAAGGGGCT C CCT GC CGAGCT GAAGT C C 
ATTTTCAAGCTCAGTGTCTTCATCCCCTCCCAGGAATTCTCCACCTACCGCCAGTGGAAGCAGAAAATTGTACAAGCT 
GGAGATAAGGAC CTT GAT GGGCAGCT AGACT TT GAAGAAT TTGT CCAT TAT CTCCAAGATCAT GAGAAGAAGCTGAGG 
CTGGTGTTTAAGATTTTGGACT^WUVGAATGATGGACGCATTGACGCGC^GGAGATCATGCAGTCCCTGCGGGACTTG 
GGAGT CAAGATAT CT GAACAG CAGGCAGAAAAAATT CTCAAGAGCAT G GATAAAAACGGCAC GAT GAC CAT CGACT GG 
AACGAGT GGAGAG ACTACCACCT CCT CCACCCCGTGGAAAACAT CCCCGAGATCATCCTCTACT GGAAGCATTCCACG 
AT CTTT GAT GTGGGT GAGAATCTAACGGT C CC GGATGAGT T CACAGTGGAGGAGAGGCAGAC G GGGAT GT GGT GGAGA 
CACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGGACAGGCTCAAGGTGCTCATG 
CAGGTCCAT GCCTCCCGCAGCAACAACAT GGGCATCGTTGGTGGCTT CACT CAGATGATTCGAGAAGGAGGGGCCAGG 
TCACTCTGGCGGGGCAATGG(^TCAACGTCCTCAAAATTGCCCCCGAATCAGCC^TCAAATTCATGGCCTATGAGC^G 
AT CAAGC GC CTT GTT GGTAGT GACCAGGAGACT CT GAGGATTCACGAGAGGCTT GTGGCAGGG TC CTT GG CAG GGG C C 
ATCGCCCAGAGCAGCATCTACCCAATGGAGGTCCTGAAGACCCGGATGGCGCTGCGGAAGACAGGCCAGTACTCAGGA 
ATGCTGGACTGCGCCAGGAGGATCCTGGCCAGAGAGGGGGTGGCCGCCTTCTACAAAGGCTATGTCCCCAACATGCTG 
GGCATCATCCCCTATGCCGGCATCGACCTTGCAGTCTACGAGACGCTC^ 

AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 
CCCCTGGCCCTAGTCAGGACCCGGATGCAGGCGCAAGCCTCTATTGAGGGCGCTCCGGAGGTGACCATGAGCAGCCTC 
TTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCA 
GCTGTGAGCATCAGCTACGTGGTCTACGAGAACCTGAAGATCACCCTGGGCGTGCAGTCGCGGTGACGGGGGGAGGGC 
m CGCCCGGCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATGTGCCAACACT 
|b AAGCTGTCTCGAGCCAAGCTGTGAAAACCCTAGACGCACCCGCAGGGAGGGTGGGGAGAGCTGGCAGGCCCAGGGCTT 
^ 2 GTCCTGCTGACCCCAGCAGACCCTCCTGTTGGTTCCAGCGAAGACCACAGGCATTCCTTAGGGTCCAGGGTCAGCAGG 
j.f| CTCCGGGCTCACATGTGTAAGGACAGGACATTTTCTGCAGTGCCTGCCAATAGTGAGCTTGGAGCCTGGAGGCCGGCT 
TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 
*# CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
fe - ATAAT C CAT GAT GAAAGGT GAGGTCACGT GGCCT C CCAGGC CT GACTT C CCAAC CTACAG CAT T GACGCCAACTT GGC 

'»m T GTGAAG GAAGAG GAAAG GAT CT GGCCTT GTGGT CACTGGCAT CT GAGC CCT GCT GATG GCT GGGGCT CT CGG GCATG 

%*t CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
|,fl GGCCTGGACCCTGTCAGGATGGGCCCCACCTC^GAACCAAACTCACTGTCCCCACTGTGGCATGAGGGCAGTGGAGCA 
m CCATGTTTGAGGGCGAAGGGCAGAGCGTTTGTGTGTTCTGGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATTATGG 
S?l ACTGTTGGGAAAAGGGTTTTGTCCAGAAGGACAAGCCGGACAAATGAGCGACTTCTGTGCTTCCAGAGGAAGACGAGG 
ff| GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 
pi GGGACCAGCCCCACATTCCACTTGTGTCACTGCTTGGAACCTATTTATTTTGTATTTATTTGAACAGAGTTATGTCCT 
■ W AACTATTTTTATAGATTTGTTTAATTAATAGCTTGTCATTTTCAAGTTCATTTTTTATTCATATTTAT GTT CATGGTT 

Fjj GATTGTACCTTCCCAAGCCCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTTGGGCCGCTGCAGTCACATCT 
GTCCAGAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTGGCTCCTTTCCTTTGGCAG 
GTTGG G GAAGGG CTT G CC C CCAGCCT TAGGATTT CAG GGT TT GACTGG GGGCGT GGAGAGAGAGGGAGGAACCTC AAT 
AACCTTGAAGGTGGAATCCAGTTATTTCCTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCCATTCCACCAGAATGACCTGA 
T GAGGAAAT CTT CAATAGGAT GC AAAGAT CAATG CAAAAATT GTTAT ATAT GAACATATAACT GGAGT C GT CAAAAAG 
CA^TTAAGAAAGAATTG GAC GTTAGAAGTT GTCATT TAAAGCAGC CT T CT AAT AAAGTT GT T T CAAAGCT GAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 58 



MLCLCLYVPVIGEAQTEFQYFESKGLPAELKSIFKLSVFIPSQEFSTYRQWKQKIVQAGDKDLDG 
QLDFEEFVHYLQDHEKKLRLVFKILDKKNDGRIDAQETMQSIiRDLGVKISEQQAEKILKSMDKNG 
TMT I DWNEWRD YHLLHP VEK I PE 1 1 LYWKHS T I FDVGENLTVPDE FTVEERQTGMWWRHL VAGGG 
AGAVSRTCTAPLDRLKVLMQVHASRSNmGIVGGFTQMlREGGARSLWRGNGINVLKIAPESAIK 
F^YE Q I KRL VGS DQE TLRI HERLVAGS LAGAI AQS S I YPMEVLKT RMALRKTGQ YS GMLDCARR 
ILAREGVAAFYKGYVPNMLGI I PYAG I DLAVYETLKNAWLQHYAVNSADPGVFVLLACGTMSS TC 
GQLAS YPLALVRTRMQAQAS I EGAPE VTMS S L FKHI LRTEGAFGLYRGLAPNFMKVI PAVS I S YV 
VYENLKITLGVQSR 



|J| Important features: 

i£j Signal peptide: 

m 

m 
m 

» f\ Putative transmembrane domains : 



amino acids 1-16 



amino acids 284-304, 339-360, 376-394 

Mitochondrial energy transfer proteins signature. 

amino acids 206-215, 300-309 



I'll N-glycosylation sites. 

amino acids 129-133, 169-173 



Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE so 



GGAAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCCATGGC 
TTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAGCAATTG 
CACTCATCATTGGCTTTGGTATTTCAGGGAGACACTCCATCACAGTCACTACTGTCGCCTCAGCT 
GGGAACATTGGGGAGGATGGAATCCTGAGCTGCACTTTTGAACCTGACATCAAACTTTCTGATAT 
CG T G AT ACAATGGCTGAAGGAAGG T G T TT T AGGC T T GGT C CAT GAGT T C AAAGAAGGCAAAGAT G 
AGCTGTCGGAGCAGGATGAAATGTTCAGAGGCCGGACAGCAGTGTTTGCTGATCAAGTGATAGTT 
GG C AAT GCCT CT T T GCGGC T GAAAAAC G TG CAAC T C AC AGAT GC TGGCACC TACAAATGT TATAT 
CATCACTTCTAAAGGCAAGGGGAATGCTAACCTTGAGTATAAAACTGGAGCCTTCAGCATGCCGG 
p AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 
£j CAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCAGGGAGCCAACTTCTCGGAAGTCTCCAATAC 

m 



m 



CAGCTTTGAGCTGAACTCTGAGAATGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGA 
T CAACAACACAT ACTCCTG TATGATTGAAAATGACATT GCCAAAGCAACAGG GGATATCAAAGT G 



AC AGAA T C G GAG AT C AAAAGG C GGAG T CAC C TAC AG C T G C T AAAC T C AAAGGC TTCTCTGTGTGT 
ill CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAA ATAA T 
n GTGCCTTGGCCACAAAAAAGCATGCAAAGTCATTGTTACAACAGGGATCTACAGAACTATTTCAC 
CACCAGATATGACCTAGTTT TATATTTCTGGGAGGAAATGAATTCATAT C TAGAAGTCTGGAGTG 
AGCAAACAAGAGCAAGAAACAAAAAGAAGCCAAAAGCAGAAGGCTCCAATATGAACAAGATAAAT 



K CTATCTTCAAAGACATATTAGAAGTTGGGAAAATAATTCATGTGAACTAGACAAGTGTGTTAAGA 

m 

fn GT GATAAG TAAAAT GC ACG TGGAGACAAGTGCAT CCC C AGATCT CAGG GAC CTCCCCC TGC CTGT 

fil CACCTGGGGAGTGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGAATTTTTAGTTATATGTGC 
TGTAATGTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCAC 
AAATTAAGC TG TAGTATGTACCCTAAGACGCTGC TAATTGACTGCCACT TCGCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
TCTTCCCAACTGACAAATGCCAAAGTTGAGAAAAATGATCATAATTTTAGCATAAACAGAGCAGT 
CGGGGACACCGATTTTATAAATAAACTGAGCACCTTCTTTTTAAACAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 60 



MAS LGQ I LFWS I 1 S 1 1 1 ILAGAI AL 1 1 G FG I SGRHS I TVTTVASAGNI GEDG ILSCT FEPDI KLS 
D I VI QWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVI VGNAS LRLKNVQLTDAGTYKC 
YIITSKGKGNANLEYKTGAFSMPEVNVDYNASSETLRCEAPRWFPQFTVVWASQVDQGANFSEVS 
NTSFELNSENVTMKWSVLYWTINNTYSCMIENDIAKATGDIKVTESEIKRRSHLOLLNSKASL 
CVSS FFAI SWALLPLSPYLMLK 



Important features: 
Signal peptide: 

■N 8 amino acids 1-28 

. r! 

p| Transmembrane domain: 

jbj amino acids 2 58-281 

m 



N-glycosylation sites. 

amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 
220-224 



m 

|s| N-myristoylation sites. 

h l i amino acids 52-53, 12 6-132, 188-194 

n 1, 
III 
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FIGURE 6l 



T G ACG T C AG AAT CACC ATGGC CAG C TAT C C T T AC C GG C AG G G C T G C C CAG GAG C T G CAG G AC AAG 
CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTCTGGAACTCCAGGAGGACCATATG 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
GGT GGAC T CAGA.TC ACAGTGGC T ATATC TCCATGAAGG AGC TAAAGC AGGCCCT GG TCAAC T GCA 
ATT GGTCT T CATT CAAT GAT GAGAC C T GCCT CATGATGAT AAACATGTT TGACAAGACCAAGTCA 
*» GGCCGC ATCGAT GT CT ACGGCTT C TCAGCCCTGTGGAAAT TCATCCAGCAGTGGAAGAACCT CT T 

Us 

Q CCAGCAGTATGACCGGGACCGCTCGGGCTCCATTAGCTACACAGAGCTGCAGCAAGCTCTGTCCC 
|fl • AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 



III 



GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 
GGCCTTCCGGGAGAAGGACACAGCTGTACAAGGCAACATCCGGCTCAGCTTCGAGGACTTCGTCA 



w 
m 

, f% CCATGACAGCTTCTCGGATGCTATGACCCAACCATCTGTGGAGAGTGGAGTGCACCAGGGACCTT 
" r? . TCCTGGCTTCTTAGAGTGAGAGAAGTATGTGGACATCTCTTCTTTTCCTGTCCCTCTAGAAGAAC 
W ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 



AATAGTGAGGACCGGGGCTGAGGCCACACAGATAGGGGCCTGATGGAGGAGAGGATAGAAGTTGA 
ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 
AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 



f|| GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 
CTGACCAAGCATGAGAGAGATCTGTCTATGGGACCAGTGGCTTGGATTCTGCCACACCCATAAAT 
CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 
ATTTGGGGCCAAAAGTCCAGTGAAATTGTAAGCTTCAATAAAAGGATGAAACTCTGA 
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FIGURE 62 



MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
SGYISHKELKQALVNCNWSSFNDETCLMMINMFDKXKSGRIDVYGFSALWKFIQQWKNLFQQYDR 
DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSAMPMQLDRFIQVCTQLQVLTEAFREK 
DTAVQGNI RLS FEDFVTMTASRML 



Important features of the protein: 
Signal peptide: 



CI amino acids 1-19 

P 

ff| N-glycosylation site, 

\M amino acids 147-150 



Casein kinase II phosphorylation sites. 

amino acids 135-138, 150-153, 202-205, 271-274 



$ 
n 

III N-rayristoylation sites. 

fc 1 amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 

fi! 

%Z 66, 70-75, 7S-83, 83-88, 87-92, 110-115 

ml 

III 
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FIGURE 6a 



CAGGATGCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 
TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 
AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 
TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 
GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 
GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 
AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 
TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 
TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 
CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 
TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 



m ATCTGGGGTATCAATAATCAACCCCCAGGTACCAGCTGGGGAAATATTAATCGGTATCCAGGAGG 



CAGCTGGGGAAATATTAATCGGTATCCAGGAGGCAGCTGGGGGAATATTAATCGGTATCCAGGAG 
GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 
CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 
Q GGGCTAGAGCACGATAGAGGGAAAC CCAACAT TGG GAG T T AGAGTCCT GCT CCCGCCC CT TGCT G 

IJI TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 
151 TGCTCATCTCCAATAAAATAAAAGCACTTAT(y^^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAftAAAA^ 
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FIGURE 64 



MQGRVAGSCAPLGLLLVCLHLPGLFARSIGWEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 
DPRSNDIARVPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPAMDSWPPEDPWQMMAAAAEDRLGEA 
LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 



Important features of the protein: 
Signal peptide: 

amino acids 1-2 6 

Q 

£11 Casein kinase II phosphorylation sites. 

W amino acids 56-59, 155-158 

■ft! 



as 



N-myristoylation sites, 

„ amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 

|3 259-264, 269-274, 270-275, 280-285, 281-286, 305-310 



in 

0 

m 

m 
fli 
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FIGURE 6 s 



AZVGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACTATGGGGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
C T T T GCAAC TGAAGC T GAAGGAGT C T T T T C T GAC AAAT T CCT CCTAT GAGT CCAGCT TC CTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAG ATC T CAACACCAT GT T GT CT GCAACACATGACAGC CAT T GAAGC CTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
C T GAG T GG CAAT AAAT AAAAT T C GGT ATGC T G 



I? sis 

m 
m 

m 
hi 

m 

'w • 

m 



m 
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FIGURE 66 



MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HHARS QHHWCNT 



Important features: 
Signal peptide: 

amino acids 1-19 



N-glycosylation site. 

fl amino acids 37-41 

tip N-myristoylation sites. 

|JJ amino acids 15-21, 19-25, 60-66 



% ■ 

U 

m 
o 
sit 
o 

u 
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FIGURE 67 



III 
lift 

r! 

1 « 

w 



ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
GGAGCAGT T C TC CCAT GCAGCCGAGGCCCGGGCCCCTC AGGACT CCAAGGAGACGG TGCAGGGCT 
GCCTGCCCATC TAG GTCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
GGCAGTGCCC T C TC TGGGCAG T CAG AT CC ACCCAGT GC T TAATAGCAGGGAAGAAGGTACT T CAA 
A.GAC T C TGCCCCTGAGG TCAAGAGAGGATGGGGC TAT T CAC T T TTAT AT AT TTATATAAAAT TAG 
TAGTGAGATGTAZUUlAAAAAAAAA£i^.AA 
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FIGURE 68 



MANPGLGL LL ALGLP FLLARWGRAWGQI QTT S ANENS T VLP S S T S S SS DGNLRPEAI TAI I WFS 
LLAALLIAVGLALLWKLREKRQTEGTYRPSSESQFSHAAEARAPQDSKETVQGCLPI 



Important features: 
Signal peptide: 

amino acids 1-19 



i , Transmembrane domain: 

jg amino acids 56-80 



N-glycosylation site. 

amino acids 36-40 



\Q cAMP- and cGMP-dependent protein kinase phosphorylation site. 



ff amino acids 3 6-90 

D 

5 

a 



Tyrosine kinase phosphorylation site. 

amino acids 86-94 



III N-rnyristoylation sites. 

amino acids 7-13, 26-32 
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FIGURE 6q 



GC CAGGAATAAC TAGAGAGGAACA ATG GGGTTAT TCAGAG GTT TT GTTTT CCTCTTA.GT TCT GT GCCT GCT GCACCAG 
TCAAATACTTCCTTCATTAAGCTGAATAATAATGGCTTTGAAGATATTGTCATTGTTATAGATCCTAGTGTGCCAGAA 
GAT GAAAAAATAATT GAACAAATAGAGGAT ATGGT GACTACAGCTTCTAC GTAC CTGT TTGAAGCCACAGAAAAAAGA 
T T TTTT TT CAAAAAT GTATCTATATT AAT T CCT GAGAATT GGAAG GAAAATCCT CAGTACAAAAGGCCAAAACAT GAA 
AACCATAAACAT GCT GATGTTATAGTTGCACCACCTACACT CCCAGGTAGAGATGAAC CATACACCAAGCAGTTCACA 
GAATGTGGAGAGAAAGGCGAATACATTCACTTCACCCCTGACCTTCTACTTGGAAAAAAACAAAATGAATATGGACCA 
CCAGGCAAACTGTTTGTCCATGAGTGGGCTCACCTCCGGTGGGGAGTGTTTGATGAGTACAATGAAGATCAGCCTTTC 
TACCGTGCTAAGTCAAAAAAAATCGAAGCAACAAGGTGTTCCGCAGGTATCTCTGGTAGAAATAGAGTTTATAAGTGT 
CAAGGAGGCAGCTGTCTTAGTAGAGCATGCAGAATTGATTCTACAACAAAACTGTATGGAAAAGATTGTCAATTCTTT 
CCTGATAAAGTACAAACAGAAAAAGCATCCATAATGTTTATGCAAAGTATTGATTCTGTTGTTGAATTTTGTAACGAA 
AAAAC CCATAAT CAAGAAGC TC CAAGCCTACAAAACAT AAAGT GCAATTT T AGAAGT ACAT GGGAGGT GAT TAG CAAT 
TCTGAGGATTTTAAAAACACCATACCCATGGTGACACCACCTCCTCCACCTGTCTTCTCATTGCTGAAGATCAGTCAA 
AGAATTGTGTGCTTAGTTCTTGATAAGTCTGGAAGCATGGGGGGTAAGGACCGCCTAAATCGAATGAATCAAGCAGCA 
AAACATTTCCTGCTGCAGACTGTTGAAAATGGATCCTGGGTGGGGATGGTTCACTTTGATAGTACTGCCACTATTGTA 
lj s AAT AAGCTAAT C CAAATAAAAAGCAGTGAT GAAAGAAACACACT CAT GGCAGGATTAC CTAC AT ATCCT CT GGGAGGA 

?ll ACTTCCATCTGCTCTGGAATTAAATATGCATTTCAGGTGATTGGAGAGCTACATTCCCAACTCGATGGATCCGAAGTA 
hJ CT GCT GCT GACT GAT GGGGAGGATAACACT GCAAGT TCTT GTATTGATGAAGT GAAACAAAGT GGGGC CATT GTT CAT 

Q TTTATTGCTTTGGGAAGAGCTGCTGATGAAGCA.GTAATAGAGATGAGCAAGATAACAGGAGGAAGTCATTTTTATGTT 



TCAGATGAAGCTCAGAACAATGGCCTCATTGATGCTTTTGGGGCTCTTACATCAGGAAATACTGATCTCTCCCAGAAG 





V ? ' T C CCTT CAGCT C GAAAGT AAGGGATT AACACT GAAT AGTAATGCCT GGAT GAAC GACACT GT CATAATT GATAGTACA 



GTGGGAAAGGACACGTTCTTTCTCATCACATGGAACAGTCTGCCTCCCAGTATTTCTCTCTGGGATCCCAGTGGAACA 
ATAATGGAAAATTTCACAGTGGATGCAACTTCCAAAATGGCCTATCTCAGTATTCCAGGAACTGCAAAGGTGGGCACT 
T GG GCATACAAT CTT CAAGCCAAAG C GAACC CAGAAACAT TAACTATT ACAGT AACT T CT C GAGCAGCAAATT CTT CT 
GT GCCTCCAATCACAGTGAATGCTAAAATGAATAAGGACGTAAACAGTTTCCCCAGCCCAAT GATTGTTTACGCAGAA 
ATTCTACAAGGATATGTACCTGTTCTTGGAGCCAATGTGACTGCTTTCATTGAATCACAGAATGGACATACAGAAGTT 
*jW T T GGAACTTTT GGATAATGGT GCAG GCGCT GATTCT TTCAAGAATGAT GGAGTCTACT CCAGG T ATTT TACAGCAT AT 

:r. ACAGAAAAT GGCAGATATAGCTTAAAAGTT CGGGCT CAT GGAGGAGCAAACACT GCCAG GCT AAAATT ACGGCCTCCA 

isr s CTGAATAGAGCCGCGTACATACCAGGCTGGGTAGTGAACGGGGAAATTGAAGC^AACCCGCCAAGACCTGAAATTGAT 
W GAGGATACTCAGACCACCTTGGAGGATTTCAGCCGAACAGCATCCGGAGGTGCATTTGTGGTATCACAAGTCCCAAGC 
||f CTTCCCTTGCCTGACCAATACCCACCAAGTCAAATCACAGACCTTGATGCCACAGTTCATGAGGATAAGATTATTCTT 
Jf S: | ACATGGACAGCACCAGGAGATAATTTTGATGTTGGAAAAGTTCAACGTTATATCATAAGAATAAGTGCAAGTATTCTT 
fe*3 GATCTAAGAGACAGT TT TGAT GATGCTCT T CAAGT AAAT ACTACT GAT CT GTCAC GAAAGGAG GC CAACTCCAAGGAA 

ft! AGCTT T GCAT TTAAACCAGAAAATAT CTCAGAAGAAAATGCAACCCACATATTTATT GC CATTAAAAGTATAGATAAA 

IJJ AGCAAT TT GACATCAAAAGTAT CCAACATT GCACAAGT AACTTT GTTT AT CCCT CAAGCAAAT C CTGAT GACAT T GAT 

CCTACACCTACTCCTACTCCTACTCCTACTCCTGATAAAAGTCATAATTCTGGAGTTAATATTTCTACGCTGGTATTG 
||| T CT GT GATT G GGTCT GT TGTAATT GT TAACTTTATTTTAAGTAC CAC CATTTGAACCT T AAC GAAGAAAAAAAT CTT C 

AAGTAGACCTAGAAGAGAGTTTTAAAAAACAAAAC^TGTAAGTAAAGGATATTTCTGAATCTTAAAATTCATCCCAT 
GTGTGATCATAAACTCATAAAAATAATTTTAAGATGTCGGAAAAGGATACTTTGATTAAATAAAAACACTCATGGATA 
T GT AAAAACT GT CAAGATTAAAATTTAATAGTTT CATTTATTT GTTAT TTTATT T GTAAGAAATAGT GATGAACAAAG 
AT CCTTT TTCATACT GATACCT GGTT GTATATTATTTGAT GCAACAGTTT TCT GAAAT GATAT T TCAAATT G CAT C AA 
GAAATTAAAATCATCTATCTGAGTAGTCAAAATACAAGTAAAGGAGAGCAAATAAACAACATTTGGAAAAAAAA 
AAAAAAAAAAAA^W\AAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 70 



MGLFRGFVFLLVLCLLHQSNTSFI-KLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKNVSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRN 
RVYKCQGGS CLSRACRI DSTTKL YGKDCQFFPDKVQTEKAS IMFMQS I DS WEFCNEKTHNQEAP 
SLQNIKCNFRS TWEVI SNSEDFKNT I PMVTPPPPPVFSLLKI SQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKHFLLQTVENGS WGMVH FDS TAT I VNKL IQIKS SDERNTLMAGLPTYPLGGTS I CSG I K 
YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGG 
SHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLTLNSNAWMNDTVIIDSTYGKDTFFL 
ITWNSLPPSISLWDPSGTIMENFTVDATSKMAYLSIPGTAKVGTWAYNLQAKANPETLTITVTSR 



?!% AANSSVPPITTOAKMNKDVNSFPSPMIVYAEIIiQGYVPVLGANVTAFIESQNGHTEVLELLDNGA 
if| GADS FKNDGVYSRYFT AYTENGRYS LKYRAHGGANT ARLKLRP PLNRAAY I PGWVVNGE I EANP P 

U| RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQXTDLBATVHEDKIILTWTAPGDN 
FDVGKVQRY I IRIS AS I LDLRDS FDDALQVNTTDLSPKEANSKES FAFKPENI SEENATHI FI AI 
KSIDKSNLTSKYSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 
VNFILSTTI 



iff Signal peptide: 

m 

amino acids 1-21 

m 
a 



Putative transmembrane domains: 

amino acids 284-300, 617-633 

Leucine zipper pattern. 

amino acids 469-491, 476-498 



N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 
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FIGURE 71 



CT C CT TAGGT GGAAAC CCT GGGAG TAGAGT ACT GACAGCAAAX3ACC GGGAAAG ACCATACGT CCCCGGGCAGGGGT GA 
CAACAGGTGTCATCTTTTTGATCTCGTGTGTGGCTGCCTTCCTATTTCAAGGAAAGACGCCAAGGTAATTTTGACCCA 
GAGGAGCAATGATGTAGCCACCTCCTAACCTTCCCTTCTTGAACCCCGAGTTATGCCAGGATTTACTAGAGAGTGTCA 
ACTCAACCAGCAAGCGGCTCCTTCGGCTTAACTTGTGGTTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCA 
GTGCTCAGAAGTGACTTGCCTGAGGGTGGACCAGAAGAAAGGAAAGGTCCCCTCTTGCTGTTGGCTGCACATCAGGAA 
GGCT GTGAT GGGAAT GAAGGT GAAAACTT GGAGAT TT CACTTCAGTCATT GCTTCT GC CT GCAAGATCAT C CTTTAAA 
AGTAGAGAAGCTGCTCTGTGTGGTGGTTAACTCCAAGAGGCAGAACTCGTTCTAGAAGGAAATGGATGCAAGCAGCTC 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTGAATGATGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGTCCTGTAC^TGTTGGCCTGCACCCCAAAAGGTGACGAGGAGCAG 
CTGGCACTGCCCAGGGCCAACAGCCCCACGGGGAAGGAGGGGTACCAGGCCGTCCTTCAGGAGTGGGAGGAGCAGCAC 
CGCAACTACGTGAGCAGCCTGAAGCGGCAGATCGC^CAGCTCAAGGAGGAGCTGCAGGAGAGGAGTGAGCAGCTCAGG 
AATGGGCAGTACCAAGCCAGCGATGCTGCTGGCCTGGGTCTGGACAGGAGCCCCCCAGAGAAAACCCAGGCCGACCTC 
CTGGCCTTCCTGCACTCGCAGGTGGACAAGGCAGAGGTGAATGCTGGCGTCAAGCTGGCCACAGAGTATGCAGCAGTG 
?!& CCTTT CGATAGCTTTACTCTACAGAAGGTGTACCAGCTGGAGACTGGCCTTACCCGCCACCCCGAGGAGAAGCCT GTG 

AGGAAGGACAAGCG GGAT GAGTT G GT GGAAG C CAT T GAAT CAGC CT T GGAGAC C CT GAACAAT C CTGCAGAGAACAGC 

cccaatcaccgtccttacacggcctctgatttcatagaagggatctaccgaacagaaagggacaaagggacattgtat 
gagct caccttcaaaggggac cac aaacac gaatt caaac ggct cat cttattt c gaccatt cagcc c cat catgaaa 
gtgaaaaatgaaaagctcaacatggccaacacgcttatcaatgttatcgtgcctctagcaaaaagggtggacaagttc 
cggcagttcatgcagaatttcagggagatgtgcattgagcaggatgggagagtccatctcactgttgtttactttggg 
aaagaagaaataaat gaagtcaaagg aat actt gaaaacactt c caaagct g ccaacttcaggaactttac cttcatc 
ffl cagctgaatggagaattttctcggggaaagggacttgatgttggagcccgcttctggaagggaagcaacgtccttctc 
tttttctgtgatgtggac^tctacttcacatctgaattcctcaatacgtgtaggctgaatacac^gccagggaagaag 
M3 gtattttatccagttcttttcagtcagtacaatcctggcataatatacggccaccatgatgcagtccctcccttggaa 
cagcagctggtcataaagaaggaaactggattttggagagactttggatttgggatgacgtgtcagtatcggtcagac 
ttcatcaatataggtgggtttgatctggacatcaaaggctggggcggagaggatgtgcacctttatcgcaagtatctc 
« cacagcaacctcatagtggtacggacgcctgtgcgaggactcttccacctctggcatgagaagcgctgcatggacgag 
ctgacccccgagcagtacaagatgtgcatgcagtccaaggccatgaacgaggcatcccacggccagctgggcatgctg 
gt gtt caggcacgagatagaggct cacctt cg caaacaga^cagaagacaagt agcaaaaaaacatgtaact cccaga. 
; u 1 gaaggattgtgggagacactttttctttccttttgcaattactgaaagtggctgcaacagagaaaagacttccataaa 
'pj ggac gacaaaagaatt g gactgatgggtc agagat gagaaagcctcc gat t tctctct gtt gggct ttttacaacaga 

aatcaaaatctccgctttgcctgcaaaagtaacccagttgcaccctgtgaagtgtctgacaaaggcagaatgcttgtg 
agat tataag cctaat g gt gt ggaggttt t gat ggt gtt t acaatacact gagacctgttgttttgtgtgct cattga 
fhj aatat t cat gat ttaagagcagtttt gtaaaaaatt c att agcat gaaaggcaagcatattt ct c ct catat gaat ga 

gcctat cagcagggct ctagtt tctag gaat g ctaaaatat cagaaggcaggagaggagat aggcttattat gat act 
agtgagtacattaagtaaaataaaatggaccagaaaagaaaagaaaccataaatatcgtgtcatattttccccaagat 
taaccaaaaataatctgcttatctttttggttgtccttttaactgtctccgtttttttcttttatttaaaaatgcact 
ttttttcccttgtgagttatagtctgcttatttaattaccactttgcaagccttacaagagagcacaagttggcctac 

ATTTTTATATTTTTTAAGAAGATACTTTGAGATGCATTATGAGAACTTTCAGTTCAAAGCATCAAATTGATGCCATAT 
CCAAGGACATGCCAAAT GCT GATT CT GTCAG GCACT GAAT GTCAG G CATT GAGACATAGGGAAGGAAT GGTTTGTACT 
AATACAGACGTACAGATACTTT CTCT GAAGAGT ATTTT C GAAGAGGAGCAACTGAACACT GGAGGAAAAGAAAAT GAC 
ACTTTCTGCTTTACAGAAAAGGAAACTCATTCAGACTGGTGATATCGTGATGTACCTAAAAGTCAGAAACCACATTTT 
CTCCTCAGAAGTAGGGACCGCTTTCTTACCTGTTTAAATAAACCAAAGTATACCGTGTGAACCAAACAATCTCTTTTC 
AAAACAGGGTGCTCCTCCTGGCTTCTGGCTTCCATAAGAAGAAATGGAGAAAAATATATATATATATATATATATTGT 
GAAAGATCAATCCAT C TGC CAGAAT C TAGT G G GAT GGAAGTTTTT GCTACATGT TAT CCAC C CCAGGC CAGGTGGAAG 
TAAC T GAATTAT TTTTTAAATTAAGCAGTT CTACT CAAT CACCAAG AT GCTTCT GAAAATT GCAT TTTATTACCAT T T 
CAAACTATT TTTTAAAAATAAATACAGTTAACATAGAGTG GTTT CTTCATT CATGTGAAAAT TATT AGCCAGCAC CAG 
ATGCATGAGCTAATTATCTCTTTGAGTCCTTGCTTCTGTTTGCTCACAGTAAACTCATTGTTTAAAAGCTTCAAGAAC 
ATTCAAGCT GTT GGT GTGTTAAAaAATGCATTGTATTGATTTGTACTGGTAGTTTATGAAATTTAATTAAAACACAGG 
CCATGAATGGAAGGT GGTATTGCACAGCTAATAAAATATGATTTGTGGATATGAA 
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FIGURE 72 



I: s:? 
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MMMVRRGL LAW I S RVWLL VL L C C AI S VL YML AC T P KGDE E QL AL P RANS P T GKEG YQAVLQE WE 
EQHRNYVSSLKRQIAQLKSELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDK 
AEWAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPA 
ENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFK^^ 

TLINVIVPLAKRVDKFRQFMQNFREMCIEQDGRVELTVVYFGKEEINEVKGILENTSKAM 

T FI QLNGEFSRGKGLDVGARFWKGSNVLLFFCDVD I Y FTSE FLNTCRLNTQPGKK7F YPVL FSQY 

NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 

KYLHSNLIVVRTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHL 

RKQKQKTS SKKT 

Important features: 



i t l Signal peptide: 

H! amino acids 1-27 

■ n 

N-glycosylation sites . 

amino acids 315-319, 324-328 

r ?: 

1 

|y N-myristoylation sites. 

amino acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 



Amidation site. 

amino acids 377-381 
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FIGURE 73 



GAGACTGCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGATTTGTCCTGGGGATCCA 
GAAACCCATGATACCCTACTGAACACCGAATCCCCTGGAAGCCCACAGAGACAGAGACAGCAAGA 
GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTCTCACTCCTC 
CCTCCCTCTCrCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTCAACACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCAGCCTCTTACCCTGAGTGT 
GGAAACAATGCCCAGTCGCCCATCGATATTCAGACAGACAGTGTGACAT TT GACCCT GATTT GCC 
TGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGCCTTTGGACCTGCACAACAATGGCC 
ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 
GCCCAGCTCCACCTGCACTGGGGTCAGAAAGGATCCCCAGGGGGGTCAGAACACCAGATCAACAG 
TGAAGCCACATTTGCAGAGCTCCACATTGTACATTATGACTCTGATTCCTATGACAGCTTGAGTG 
AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 
AAT AT AGC T T ATGAACACAT TC T GAG T CAC T TGC AT GAAGTCAGGCATAAAG AT C AGAAGAC CT C 
AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 
GCTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 
ATTTCAATGGAACAGCTGGAAAAGCTTCAGGGGACATTGTTCTCCACAGAAGAGGAGCCCTCTAA 
GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 
TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 
TGTCTCTGCCrTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 
C CGAAAGAGTGTGGTC T TCACG T CAGCACAAGCCACGACT GAG GC ATAA AT T CC T TCT GAGATAC 
CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 
CCAGAAACACTGTAGGAGTAG TAAGCAGATGTCC T CCT TCCCC T GGACAT CT CTTAGAGAGGAAT 
GGACCCAGGCTGTCATTCCAGGAAGAACTGCAGAGC CT TCAGCCTCT CCAAACAT GTAGGAGGAA 
ATGAGGAAATCGCTGTGTTGTTAATGCAGAGANCAAACTCTGTTTAGTTGCAGGGGAAGTTTGGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
C TCCT TAGGATAAAGAG TTGCTGT TGAAGTTGTATATT TT TGATCAATATAT TTGGAAAT TAAAG 
TTTCTGACTTT 
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FIGURE 74 



MLFSALLLEVIWILAADGGQHWTYEGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTYQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRMVFASFIQAGSSYTTGEMLSLGVGILVGCLCLLLAVYFIARKIRKKRLENRKS 
WFTSAQATTEA 

Important features of the protein: 
Signal peptide: 

amino acids 1-15 

Transmembrane domain: 



ffl amino acids 291-310 



N-glycosylation site. 



■f*\ amino acids 213-216 

in 

'Q Eukaryo tic- type carbonic anhydrases proteins 



amino acids 197-245, 104-140, 22-69 
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FIGURE 75 



TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 
TCTGGTGGTTTGCCTAAACCTGCAAACATCACCTTCTTATCCATCAACATGAAGAATGTCCTACA 
ATGGAC TCCACCAGAGGGTCT TCAAGGAGTTAAAGTTACTTACACTGTGCAGTATTTCA^ CACAA 
ATTGGCCCACCAGAGGTGGCACTGACTACAGATGAGAAGTCCATTTCTGTTGTCCTGACAGCTCC 
AGAGAAGTGGAAGAGAAATCCAGAAGACCTTCCTGTTTCCATGCAACAAATATACTCCAATCTGA 
AG TATAACGTG TCTGTGTT GAAT AC TAAAT CAAACAGAACG TGGT CCCAGT G T G T GACCAAC CAC 
ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 

, AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGAGTTTGAAAGATCAATCAT 

fa* 

h\ CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 
: p T C T G T GAT GGGCTATTC CAT C TAC C G AT ATAT C CAC G T T GG CAAAGAGAAAC AC CC AGCAAAT T T 

If! GAT T T TGAT T T ATGG AAATGAAT T T GACAAAAGATT CT T T GT GC C T GC TGAAAAAAT CG T GAT T A 

ACTTTATCACCCTCAATATCTCGGATGATTCTAAAATTTCTCATCAGGATATGAGTTTACTGGGA 
AAAAGCAGTGATGTATCCAGCCTTAATGATCCTCAGCCCAGCGGGAACCTGAGGCCCCCTCAGGA 
GGAAGAGGAGG T GAAACAT T TAGGGT ATGC T T CGCAT T TGAT GGAAAT T TT T TGTGAC TC T GAAG 
% AAAACACGGAAGGTACTTCTCTCACCCAGCAAGAGTCCCTCAGCAGAACAATACCCCCGGATAAA 
13 AGAGT C AT T GAATATGAATATGAT GT CAGAACCACT GACATT TGTGCGGGGCCT GAAGAGCAGGA 

GC T CAGT TT GCAGGAGG AGGT GT CC AC ACAAGGAAC AT TAT T GGAGT CGC AGGCAGCGT T GGC AG 
TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 
CAGGAGCACACAGACTCGGAGGAGGGGCCGGAGGAAGAGCCATCGACGACCCTGGTCGACTGGGA 
flf TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 
AGCCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGCT 
CCAGACAGGCCACCAGGAGAAAATGAAACCTATCTCATGCAATTCATGGAGGAATGGGGGTTATA 
TGTGCAGATGGAAAACTGATGCCAACACTTCCTTTTGCCTTTTGTTTCCTGTGCAAACAAGTGAG 
TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 
CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 
GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 
AGCAGTCAATAAATGTT TGCC AGACT GGGTGCAGAAT T TAT TCAGGT GGG T G T 



yj 
w 



m 
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FIGURE 76 



MSYNGLHQRVFKELKLLTLCSISSQIGPPEVALTTDEKSISWLTAPEKWKRNPEDLPVSMQQIY 
SNLKYNVSVLNTKSNRTWS QCVTNHTLVL TWLE PNT LYC VHVES FVPGPPRRAQPSEKQCARTLK 
DQSSE FKAKI I FWYVLP I S IT VFLFSVMGYS I YRY IHVGKEKHPANL I L I YGNEFDKRFFVPAEK 
IVINFITLNISDDSKISHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLMEIFC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQ'TGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 

Important features: 
Signal peptide: 



|f| amino acids 1-28 

w 

*M Transmembrane domain : 

^ amino acids 140-163 

l M ■ 

T% N-glycosylation sites. 

Hi amino acids 71-74, 80-83, 89-92, 204-207, 423-426 

■W 

.pi 

m 
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FIGURE 77 



GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGCTGAC 
ACAGGAGCTGAAGGACCACAACGCCACCAGCATCCTGCAGCAGCTGCCGCTGCTCAGTGCCATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATCTGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAAGCCCTCGGCCAATGACCA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
T CG T GGAG T T C C AC AT GAC GAC T GAG GC C C AAGC CAC CAT C C G CAT GG ACACCAG T GC AAG T GGC 
CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
S TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
||| CCAATCTAGTGAAAAACCAGCTGTGTCCCGTGATCGAGGCTTCCTTCAATGGCATGTATGCAGAC 
III CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 



III 

□ 



TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 
AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 
TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 
TGATCAATGAAAAGGCTGCAGATAAGCTGGGATCTACCCAGATCGTGAAGATCCTAACTCAGGAC 
ACT CCCGAGT TT TT TATAGACCAAGGCCATGCCAAGGT GGCC CAACTGATCGTGC TGGAAG T GT T 
TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 



m TTTACACCAAAGGTGACCAACTTATACTCAACTTGAATAACATCAGCTCTGATCGGATCCAGCTG 
ATGAAC TC TGGGATTGGCTGGT TCCAACCTGATGT T CT GAAAAACAT CAT CAC TGAGATCAT CCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 
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FIGURE 78 



MAGPWT FT LLCGLLAATLI QATLS PTAVL I LGPKVI KEKLTQELKDHNATS I LQQL PLLS AMREK 
PAGG I PVL GS LVNT VLKHI IWLKVITANILQLQVKPSANDQELLA/KIPLDMVAGFNTPLVKT IVE 
FHMT TEAQAT IRMDTSAS GPTRLVLS DCAT SHGSLRI QLLYKLS FLWALAKQVMNLL VPS LPNL 
VKNQLCP V IEAS FNGMYADLLQLVKVP I S L S I DRLE FDLLYPAIKGDT I QL YLGAKLLDS QGKVT 
KWFNNSAAS L XMPTLDNI PFSL I VSQDWK7\AVAAVLSPEEFHVLLDS VLPESAHRLKS S I GLIN 
EKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
KGDQL I LNLNNI S SDRI QLMN S GI GW FQPD VLKNI I TE I IHS I LL PNQNGKLRS GVPVS LVKALG 
FEAAE SSL TKDAL VL T PAS LWKPS S PVS Q 



01 



Important features of the protein: 
Signal peptide: 



amino acids 1-21 



N~glycosylation sites. 




amino acids 48-51, 264-267, 401-404 



Glycosaminoglycan attachment site. 



amino acids 412-415 



LBP / BPI / CETP family proteins. 



amino acids 407-457 
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FIGURE 7Q 



GAGAGAAGTCAGCC TGGCAGAGAGACTC TGAAATGAGGGAT TAGAGGTGTTCAAGGAGCAAGAGC 
TTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACGCTTCTACTGAGAGGTCTGCCATGGCCTCT 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
CCAAGGGCCTCT GGAT GGAATGT GCCACACACAGCACAGGCATCACCCAGTGTGACATC T ATAGC 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
G AGCCAAAGACAGAGT GGCGGTAGC AGG T GGAGTC T T T T T CAT CCT T GGAGGC C T CC TGGGATT C 
|3 ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 
CATGAAATTTGAGATTGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAG 
CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 
CAAGCCCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 
GTTCAATTCCTACAGCCTGACAGGGTATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGTGGCTG 



? |; fl GGTCTGTGAAAAACAGTGGACAGCACCCCGAGGGCCACAGGTGAGGGACACTACCACTGGATCGT 

* GTCAGAAGGT GC TGC T GAG GATAGAC T GAC TT TGGCCAT T GGATTGAGCAAAGGCAGAAATGGGG 

III 

pi TCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 



GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 



||| GGACT CAGAGGAT CCC T T TGCCC TC T GGT TTACC TGGGAC T CGATCCCCAAACC CACT AAT CACA 

0 TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 



GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 
CCTCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCA 
GAC TAAT T TGTGCATGAACTGAAAT AAAACCATCCTACGGTATCCAGGGAACAGAAAGCAGGATG 
CAGGATGGGAGGAC AGGAAGGCAGCC T GGGACAT TT AAAAAAATA 
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FIGURE 80 



MASLGLQLVGYI LGLLGLLGTLVAMLLPS WKTS S YVGAS I VTAVGFSKGLWMECATHS TG I TQC D 
I Y STLLGLPADI QAAQAMMVT S SAI S S LAC 1 1 S WGMRCTVFCQE SRAKDRVAVAGGVFFI LGGL 
LGFIPVAWNLHGILRDFYSPLVPDSMKFEIGEALYLGIISSLFSLIAGIILCFSCSSQRNRSNYY 
DAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 



■■■■■ - 



Transmembrane domains: 



m 
ha 

h8 



amino acids 82-102, 117-140, 163-182 



N-glyeosylation site. 



amino acids 190-193 




PMP-22 / EMP / MP20 family proteins. 



amino acids 4 6-59 
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FIGURE 81 



CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 
CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 
GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 
GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCeGG 
ACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 
CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 
CCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACTTGGAGCCACAAGGCCTGATGT 
ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 
GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCC'CAAGTGTGTGGAACCTCACAC 
TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 
AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 
ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 
ACTGCCAGAC TCCT GCTGCCAAGCC TGCAAAGATG AGGCAAGTGAGCAATCGGAT GAAGAGGACA 
GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 
AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 
CAGACCCAAGGGAGCAGGCAGCACAACTGTCAAGATCGTCCTGAAGGAGAAACATAAGAAAGCCT 
GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 
CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 
CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 
ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 
CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 
CTCGGACTTGGTGGAGATCTACCTCTGGAAGCTGGTAAAAGATGAGGAAACTGAGGCTCAGAGAG 
GTGAAGT ACC TGGCCCAAGGC GACACAG CC AGAAT C T T CCAC T T GAC TC AG AT GAAGAAAGT CAG 
GAAGCAAGACTTCCAGAAAGAGGCACAGCACTTCCGACTGCTCGCTGGCCCCCACGAAGGTCACT 
GGAACG TC T TCC TAGC C CAGACC C T GG AGC T GAAGGT CACGGCCAGT CC AGACAAAGTGACCAAG 
AC ATAACAAAGACCTAACAGT T GCAGATATGAGC T GT ATAAT T GT TGT TAT T AT ATAT TAATAAA 
TAAGAAGTTGCATTACCCTCAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 82 

MVPEVRVLSSLLGLALLWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGLMYCLRCTCSE 
GAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHMGTMYOHGEIFSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
ISSTRCPKAPGRVLVHTSVSPSPDNLRRFALEHEASDLVEIYLWKLVKDEETEAQRGEVPGPRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 



Signal peptide: 

?l% amino acids 1-25 

m 

m 

m 

yj 

•.n 

m 



ztti.r 

m 
m 
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FIGURE 8s 

GACAGCTGTGTCTCGATGGAGTAGACTCTCAGAACAGCGCAGTTTGCCCTCCGCTCACGCAGAGCCTCTCC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
TCCGTTTCCATGCCGTGAGGTCCATTCACAGAACACATCCAT6GCTCTCATGCTCAGTTTGGTTCTGAGTC 
TCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAG 
GACGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 
CCAGTTCTCTAGCGTGGTCCACCTCTACAGGGACGGGAAGGACCAGCCATTTATGCAGATGCCACAGTATC 
AAGGCAGGACAAAACTGGTGAAGGATTCTATTGCGGAGGGGCGCATCTCTCTGAGGCTGGAAAACATTACT 
GTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTCTTACTACCAGAAGGCCATCTGGGAGCT 
ACAGGT GT CAGC ACTGGGCT CAGTTC CT CTCATTT C CAT C ACGGGATAT GTT GAT AGAGACAT CC AGCT AC 



p% TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCC 

?*\ ACAGACTCCAGGACAAACAGAGACATGCATGGCCTGTTTGATGTGGAGATCTCTCTGACCGTCCAAGAGAA 

fft CGCCGGGAGCATATCCTGTTCCATGCGGCATGCTCATCTGAGCCGAGAGGTGGAATCCAGGGTACAGATAG 

Ui GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 

W l TTTTTTGGCATTGTTGGACTGAAGATTTTCTTCTCCAAATTCCAGTGGAAAATCCAGGCGGAACTGGACTG 

, n 

GAGAAGAAAGCACGGACAGGCAGAATTGAGAGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAG 



"?* f AGACGGCTCACCCGAAGCTCTGCGTTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTG 
CCTCACTCTGAGAAGAGATTTACAAGGAAGAGTGTGGTGGCTTCTCAGAGTTTCCAAGCAGGGAAACATTA 
IP CTGGGAGGTGGACGGAGGACACAATAAAAGGTGGCGCGTGGGAGTGTGCCGGGATGATGTGGACAGGAGGA 
pi AGGAGTACGTGACTTTGTCTCCCGATCATGGGTACTGGGTCCTCAGACTGAATGGAGAACATTTGTATTTC 
fj| ACATTAAATCCCCGTTTTATCAGCGTCTTCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTA 
gj TGAGTGTGGGACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGTCGGTTTG 
fll AAGGCTTATTGAGGCCCTACATTGAGTATCCGTCCTATAATGAGCAAAATGGAACTCCCATAGTCATCTGC 
CCAGTCACCCAGGAATCAGAGAAAGAGGCCTCTTGGCAAAGGGCCTCTGCAATCCCAGAGACAAGCAACAG 
TGAGTCCTCCTCACAGGCAACCACGCCCTTCCTCCCCAGGGGTGAAAT GTAG GATGAATCACATCCCACAT 
TCTTCTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGCAGCCGGCCAAGGTGGCTTCCA 
GATG AAGGGGGACT GGCCT GTCCACAT GGGAGTC AGGT GT C ATG GCT GCCCT GAGCTGGGAGGGAAG AAGG 
CTGACATTACATTTAGTTTGCTCTCACTCCATCTGGCTAAGTGATCTTGAAATACCACCTCTCAGGTGAAG 
AACCGTCAGGAATTCCCATCTCACAGGCTGTGGTGTAGATTAAGTAGACAAGGAATGTGAATAATGCTTAG 
ATCT T ATT GATGAC AGAGT GTATCCTAATGGTTTGT T CAT T ATATT ACACTT TC AGTAAAAAAA 
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FIGURE 84 



MALMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLS PKTNAEAMEVRFFRGQFS SWH 
LYRDGKDQPFMQMPQYQGRTKLVKDSIAEGRISLRLENITVLDAGLYGCRISSQSYYQKAIWELQ 
VSALGSVPLISITGYVDRDIQLLCQSSGWFPRPTAKWKGPQGQDLSTDSRTNRDMHGLFDVEISL 
TVQENAGSISCSMRHAHLSREVESRVQIGDTFFEPISWHLATKVLGILCCGLFFGIVGLKIFFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKS WAS QS FQAGKH YWE VDGGHNKRWRVG" /CRDDVDRRKE Y VT L S PDHG Y WVLRLNGEH L Y FT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
PIVICPVTQESEKEASWQRASAIPETSNSESSSQATTPFLPRGEM 

Signal peptide: 

amino acids 1-17 



O 

m 
m 

|f| Transmembrane domain: 

>S amino acids 239-255 



m 

m 
o 
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FIGURE 85 



AACAGACGTTCCCTCGCGGCCCTGGCACCTCTAACCCCAGAC ATG CTGCTGCTGCTGCTGCCCCT 
GCTCTGGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAACTGCTGACGATGCAGAGTTCCGTGA 
CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 
TACCCTGGCCCAG TAGT TCAT GGC T AC T GG T T C C G GGAAGGGGCCAATACAG ACCAGGAT GCT CC 
AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 
GGGACCCACATACC AAGAAT TGCAC C CT GAGCAT CAGAGAT GC CAGAAGAAGT GAT GCGGGGAGA 
T ACT T CT T T CGT AT GGAGAAAGGAAGT ATAAAATGGAAT T ATAAACATCAC CGGC TC T CT G T GAA 
TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 
AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 



r\ 

M 

W GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 

ftt 

^" GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 

bl 

W ACAAGACCGTCCATCTCAACGTGTCCTACCCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGA 
fe g GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 
fell GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 

81 

pi 



GAGGCCTGAGCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 
CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 
GAACGTCTCCCTGCAGAGCAAAGCCACATCAGGAGTGACT CAGGGGGTGGTCGGGGGAGCTGGAG 



ill 

a 

fn CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 



IJ--J TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 
?,W AGCCTCTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAG 
CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 
AAGCCT TGGGACTCGCGGGGACAGGAGGCCACTGAC ACCGAGTACT CGGAGAT CAAGAT CCACAG 
ATGAGAAACTGCAGAGACTCACCCTGATTGAGGGATCACAGCCCCTCCAGGCAAGGGAGAAGTCA 
GAGGCTGATTCTTGTAGAATTAACAGCCCTCAACGTGATGAGCTATGATAACACTATGAATTATG 
TGCAGAGTGAAAAGCACACAGGCTTTAGAGTCAAAGTATCTGAAACCTGAATCCACACTGTGCCC 
T C C C T T T TAT T T T T T T AAC T AAAAG AC AGACAAAT T CC T A 
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FIGURE 86 



MLLLLLPLIiWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPWHGYWFREG 
ANTDQDAPVATNNPARAVWEETRDRFHLLGDPHTKNCTLSIRDARRSDAGRYFFRMEKGSIKWNY 
KHHRLSVNVTALTHRPNILIPGTLESG.CPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQMLTMTVFQGDGTVSTVLGNGSSL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGSQQVYLNVS LQSKATSGVTQGWGGAGATALVFLS FCVI FVWRSCRKKSARPAAGVGDTG I E 
DANAVRGS AS QGPL TE PWAEDS P PDQP P PAS ARS S VGEGELQYASLS FQMVKPWDSRGQEAT DTE 
YSEIKIHR 




Signal peptide: 



amino acids 1-15 



Transmembrane domain: 



amino acids 351-370 
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FIGURE 87 



AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTGTAC 
C CAAGGAAAG.TGCAGCTGAGACT CAGACAAGATTACA ATGA ACCAACTCAGCT TCC TGCT GT TT C 
TCATAGCGACCACCAGAGGATGGAGTACAGATGAGGCTAATACTTACTTCAAGGMTGGACCTGT 
T CTTCG TC TCCATC TCTGCCCAGAAGCTGCAAGGAAAT CAAAGACGAAT GT CCTAGTGCAT T TGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
GGCGAT CGC TGGT CCAGT CAGCAGGG CAGCAAAGCAGAC T ACC CAGAGGGGGACGGCAAC TGGGC 
CAACTACAACACCTTTGGATCTGCAGAGGCGGCCACGAGCGATGACTACAAGAACCCTGGCTACT 
ACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 
AGAAACAGCTCCCTGCTGAGGTACCGCACGGACACTGGCTTCCTCCAGACACTGGGACATAATCT 
GT T T GGCATCTACCAGAAATATC CAG T GAAATATG GAGAAGGAAAGT G T TGGAC T GACAACGGC C 
C GG T GAT CCCTGTGGTC TATGAT T T T GGCGACGC CC AGAAAACAGCAT C T TAT TAC T CAC CC T AT 
GGCCAG C GGGAAT T CAC T GCGGGAT T T GT T CAG TT CAGGGT AT T TAAT AACGAGAGAGCAGC CAA 
J.Q CGCCTTGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 
n GATAC TT T C CAGAGGCCAG TCCC CAGCAGTGTGGAG AT T T T T C T GGT T TT GAT T GGAGT GGAT AT 

GGAACTCATGTTGGTTACAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCG 
TTGAGAGTTTTGTGGGAGGGAACCCAGACCTCTCCTCCCAACCATGAGATCCCAAGGATGGAGAA 
CAACTTACCCAGTAGCTAGAATGTTAATGGCAGAAGAGAAAACAATAAATCATATTGACTCAAGA 
AAAAAA 
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m 
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FIGURE 88 



MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTENGVI 
YQT FCDMT S GGGGWTLVAS VHENDMRGKC TVGDRWS S QQGSKAD YPEGDGNWANYNT FGS AE AAT 
SDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFGIYQKYPVKYG 
EGKCWTDNGPVIPWYDFGDAQKTASYYSPYGQREFTAGFVQFRVFNNERAANALCAGMRVTGCN 
TEHHCIGGGGYFPEASPQQCGDFSGFDWSGYGTHVGYSSSREITEAAVLLFYR. 

Important features : 
Signal peptide: 




N-glycosylation site. 



Glycosaminoglycan attachment sites. 



amino acids 1-16 



amino acids 163-167 




amino acids 74-78, 289-293 



N-myristoylation sites. 

amino acids 76-82, 115-121, 124-130, 253-259, 292-298 
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FIGURE 8q 



o 



CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCC ATG GGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTrGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
TCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGCTG2^AAAAGAAACCCTTCTGATTACCTTCA 
TGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGGCCGCTTCGTATTCCTGGAAGAAGGAAG 
GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 



m TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATAAAATATGTTTTGTAGTAACA 

hi l 

111 TTAAGAC TTATATACAGT TT TAGGGGACAAT T AAAAAAAAAAAA 

m 

.Hi 

» * 

* 

ft'R% 

M 

W! 



M 
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FIGURE qo 

MGRVSGLVPSRFLTLLAHLVWITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 
VELAGFL S GVSMFNS T Q S L I S I GAHCS AS VAL S FF I FE R WE C TT YW Y I FVFCS AL P AVTEMAL FY 
TVFGLKKKPF 



Transmembrane domain: 

amino acids 12-28 (type II) , 51-66, 107-124 



r\ 
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FIGURE qi 



CTGGGACCCCGAAAAGAGAAGGGGAGAGCGAGGGGACGAGAGCGGAGGAGGAAGATGCAACTGAC 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CACCCTCAGCGAAGGTGAAGAAAATCTTTGGCTGGGGCGACTTCTACTCCAACATCAAGACGGTG 
GCCC T GAACC T GCT CGT CACAGGGAAGAT TGT GGAC CAT GGCAATGGGAC CT T CAGCG TC C ACT T 
CCAACACAATGCCACAGGCCAGGGAAACATCTCCATCAGCCTCGTGCCCCCCAGTAAAGCTGTAG 
AGTTCCACCAGGAACAGCAGATCTTCATCGAAGCCAAGGCCTCCAAAATCTTCAACTGCCGGATG 
GAGTGGGAGAAGGTAGAACGGGGCCGCCGGACCTCGCTTTGCACCCACGACCCAGCCAAGATCTG 



HI CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 

Ul TCTACATCGCCTTCTACAGCACGGACTATCGGCTGGTCCAGAAGGTGTGCCCAGATTACAACTAC 

l?l CATAGTGATACCCCCTACTACCCATCTGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 

■J% 

■ ? !J T G GAAGGACAGGCCTGCCCAT GCAGGAGAC CAT C T GGACACCGGGCAGGGAAGGGGT TGGGC C T C 
AGGCAGGGAGGGGGGTGGAGACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAG 

% 

m AGAAAGGGTCCCAAGTGCTGGTCCCAACCTGAAGCTGTGGAGTGACTAGATCACAGGAGCACTGG 

If! AGGAGGAGTGGGCTCTCTGTGCAGCCTCACAGGGCTTTGCCACGGAGCCACAGAGAGATGCTGGG 



TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 
CCT TGGT TC TTGCCAT CCTGAGGAAAGATAGCAACAGGGAGGGGGAGAT TT CATCAGTGT GGACA 



o 
a 

S3 ■ 

p | GCCT GT CAAC TT AGGATGGATGGC T GAGAGGG C T T C C TAGGAGCCAGTCAGCAGGGTGGGGT GGG 

GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 
GCACACCCATCCTTAAGCTAAGACAGGACGATTGTGGTCCTCCCACACTAAGGCCACAGCCCATC 
CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 
TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 
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FIGURE Q2 



MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPMANSTL 
LGLLAPPGEAWGI LGQPPNRPNHS PPPS AKVKKI FGWGDFYSNIKTVALNLLVTGKIVDHGNGT F 
SVHFQHNATGQGNISISLVPPSKAVEFHQEQQIFIEAKASKIFNCRMEWEKVERGRRTSLCTHDP 
AKICSRDHAQSSATWSCSQPFKWCVYIAFYSTDYRLVQKVCPDYNYHSDTPYYPSG 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 




N-glycosylation sites. 



2-oxo acid dehydrogenases acyl transferase 



amino acids 62-65, 127-130, 137-140, 143-146 



amino acids 61-71 




IB 
O 
111 
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FIGURE Q3 



m 



CGGTGGCCATGACTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGATGGACCAACACAGAAATATGTGCTGATCTTTGGAGCGTTTGTCTCTGTCTATATCCAAGAA 
AT GT T CCGAT T T G CAT AT TAT AAAC T C T TAAAAAAAGC CAG T GAAGG T T T GAAG AG TAT AAACCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCGTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
TCCTGACGCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
AAGCCTGAAACTCTGCCTGCTCTGCCAAGACAAGAACTTTCTTCTTTACAACCAGCGCTCCAGAT 
AACCTC AGGGAACCAGCACTTCC CAAACCGCAGACT AC AT C T T TAGAGGAAGCACAACTGTGCCT 



W 

m 

j.H TTTTCTGAAAR.TCCCTTTTTCTGGTGGAATTGAGftAaGAM.TAaAACTATGCAGATA 

» 

P 

If! 

a 

CO 
ft 

fll 
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FIGURE Qd 



MTAAVFFGCAFIAFGPALALYVFTIAIEPLRI I FLIAGAFFWLVSLLI SSLWFMARVI IDNKDG 
PTQKYLLIFGAFVSVYIQEMFRFAYYKLLKKASEGLKSINPGETAPSMRL.LAYVSGLGFGIMSGV 
FSFVNTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVSAQT FI S S Y YG INLASAF 1 1 LVXMGTWAFLAAGGS CRS LKLCLLCQDKN FLLYNQRS R 



m 
ci 

fl 
hi 



Important features of the protein: 
Signal peptide: 

amino acids 1-19 



O 

m 
w 

Glycosaminoglycan attachment site 

ill amino acids 120-123 



Transmembrane domains: 

amino acids 32-51, 119-138, 152-169, 216-235 



Sodium: neurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE 



AAT T T T TCACCAGAGT AAAC T T GAGAAACCAAC TGGACC T T GAGTAT T GT AC AT T TTGCCTC GT G 
G AC CC AAAGG T AGCAATCT GAAACATGAGGAG TACGAT TC TAC TGTTTTGT CT TC TAGGAT CAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAACACTACCAAACCAACAGCAGTCAAATCAGGTCT TTCCT TC TT TAAGTCTGATACCATTAACA 
CAGATGCTCACACTGGGGCCAGATCTGCATCTGTTAAATCCTGCTGCAGGAATGACACCTGGTAC 
C CAGACCCACCCAT T GACCCTGGGAGGGT TGAAT GT ACAAC AGCAAC T GCACCCACATG T G T TAC 
CAAT T T T TGT CACACAAC T TGGAGC CCAGGGCAC TAT C C T AAGC T CAGA.GGAAT T GCCACAAAT C 
TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 
TAATCCAGATGTCCAGGATGGAAGCCTTCCAGCAGGAGGAGCAGGTGTAAATCCTGCCACCCAGG 
GAACCCCAGCAGGCCGCCTCCCAACTCCCAGTGGCACAGATGACGACTTTGCAGTGACCACCCCT 

GCAGGCATCCAAAGGAGCACACATGCCATCGAGGAAGCCACCACAGAATCAGCAAATGGAATTCA 

Is?* 

| s | GTAAGCTGTTTCAAATTTTTTCAACT7VAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
ffl AT T GAT TATATT ATGGAATAGATTGAGACACATTGGATAGTCTTAGAAGAAAT T AAT TCTTAAT T 

Ui TACCTGAAAATATTCTTGAAATTTCAGAAAATATGTTCTATGTAGAGAATCCCAACTTTTAAAAA 



o 

'-ST 



fll 



C AATAAT TCAATG GATAAAT CTGTCTTT GAAAT ATAAC AT TAT GC T GCCT GGATGATAT GCATAT 

TAAAACATATTTGGAAAACTGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE q6 



If! 

W 
pi 



. F4 



if! 
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a! 
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PS 5 ' 



MRSTILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 
LPQIFTSLIIHSLFPGGILPTS QAGAHPDVQDGS L PAGGAGVNPATQGT PAGRL P T PS G 
T DDDFAVTTPAGI QRS THAI EEAT TE SANG I Q 

Signal peptide: 

amino acids 1-16 
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FIGURE Q7 



GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACC ATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
GCAGGCAGAGTA.CCAGCTACAGGTCACCCTGGAGAT GCAGGATGGACAT GT CT TGTGGGGTCCAC 
AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
GGATGAGCCAGGCACAGCCAACT CGGATCTTCGAT T CCACATCCTGAGCCAGGCT CCAGCCCAGC 
CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTGAGCCCCAAGGGGAGC 
& ACCAGCCTTGACCACGCCCTGGAGAGGACCTACCAGCTGTTGGTACAGGTCAAGGACATGGGTGA 
CCAGGCCTCAGGCCACCAGGCCACTGCCACCGTGGAAGTCTCCATCATAGAGAGCACCTGGGTGT 
5 CCCTAGAGCCTATGCACCTGGCAGAGAATCTCAAAGTCCTATACCCGCACCACATGGCCCAGGTA 
i? CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 
l! TGCAGAGGGAAACCTCTACGTGACCAGAGAGCTGGACAGAGAAGCCCAGGCTGAGTACCTGCTCC 
|| AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 
ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 
CAGTCCACCAGGTACTGAAGTGACTAGACTGTCAGCAGAGGATGCAGATGCCCCCGGCTCCCCCA 
ATT CCCACGTTGTGTATCAGC TCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTT C 
i|I CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 
p CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
f-| TCGAAGTCGCAGTCACAGATATCAATGATCACGCCCCTGAGTTCATCACTTCCCAGATTGGGCCT 
; F « ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
j|| CCTCGAGCCCGCCTTCCGCCTCATGGATTTTGCCATTGAGAGGGGAGACACAGAAGGGACTTTTG 
Is! GCCTGGATTGGGAGCCAGACTCTGGGCATGTTAGACTCAGACTCTGCAAGAACCTCAGTTATGAG 
£0 GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 
Q AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGGCACCCCCCAAGTTGG 
ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 
CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 
CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
TACCTCTGCACACCCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 
GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 
ATTTTCACCCACTGGACCATGTCAAGGAAGAAGGACCCGGATCAACCAGCAGACAGCGTGCCCCT 
GAAGGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
TCCCCTGGGAGAGAGCCCAGCACCCAAGATCCAGCAGGGGACAGGACAGAGTAGAAGCCCCTCCA 
TCTGCCCTGGGGTGGAGGCACCATCACCATCACCAGGCATGTCTGCAGAGCCTGGACACCAACTT 
TAT GGACTGCCCAT GGGAGTGC TCCAAATG TCAGGGTGTT TGCCCAATAAT AAAGCCCCAGAGAA 
CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 



rii 



App JD=10063699 Page 251 of 322 



P 



u 

m 
o 



y 



FIGURE q8 



MVPAWLWLLCVSVPQALPKAQPAELSVSVPENYGGNFPLYLTKLPLPRE.GAEGQIVLSGDSGKAT 
EGPFAMDPDSGFLLVTRALDRESQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPGIPFLFLEASDRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRLGALALSPKG 
S TS LDHALERT YQLLVQVKDMGDQAS GHQATAT VEVS 1 1 ES TWVS I*E P I HLAENLKVL YPHHMAQ 
VHWSGGDVHYHLESHPPGPFEVNAEGNLYVTRELDREAQAEYLLQVRAQNSHGEDYAAPLELHVL 
VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 
FQVDPTSGSVTLGVLPLRAGQNILLLVLAMDLAGAEGGFSSTCEVEVAVTDINDHAPEFITSQIG 
PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 
EAAPSHEWWVQSVAKLVGPGPGPGATATVTVLVERVMPPPKLDQESYEASVPISAPAGSFLLT 
IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 



Q QYLCTPRQDHGLIVSGPSKDPDIASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
REH 1 1 P VWSHNAQMWQLLVRV I VCRCNVEGQCMRKVGRMKGMP TKLSAVG I LVGTLVAI GI FL I 
LI FTHWTMSRKKDPDQPADSVPLKATV 



m 

ill 

m 

ji| Signal peptide: 



amino acids 1-18 

Transmembrane domain: 

amino acids 762-784 
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FIGURE qq 



GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 
AG T CCAAGAT TC TT CC CAGGAAC AC AAAC G TAGGAG ACCCACGCT CC TGGAAGC AC CAGC CT TTA 
TCTCTTCACCTTCAAGTCCCCTT7CTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
ATC TAGGAC C CAGGC AT CT TGCT T T CC AGC CACAAAGAG AC AGAT6AAGATGCAGAAAGGAAAT G 
T TCTCCTTATGTTT GGTCTACTATT GCAT T TAGAAGC TGCAACAAAT TCCAATGAGACTAGCACC 
T CT GCCAACAC T GGAT CCAGTGT GAT C T CCAGTGGAGCCAGCACAGCCACCAACT C T GGGTC CAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
GCATAGTCACCAACTCTGAGTTCCATACAACCTCCAGTGGGATCAGCACAGCCACCAACTCTGAG 
TTCAGCACAGCGTCCAGTGGGATCAGCATAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGG 
GGCCAGCACAGCCACCAACTCTGAGTCCAGCACACCCTCCAGTGGGGCCAGCACAGTCACCAACT 
C TGGGTCCAG T GTGACC TCCAGT GGAGCCAGC AC T GCC AC CAAC TC T GAG T CCAGCACAGTGT C C 
AGTAGGGCCAGCACTGCCACCAACTCTGAGTCTAGCACACTCTCCAGTGGGGCCAGCACAGCCAC 
CAACT C T GAC T C CAGC ACAACC T C C AGTGGGGC TAGCACAGC C ACCAAC T C TGAGTCCAGCACAA 
H CCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACAGTGTCCAGTAGGGCCAGCACT 
«i GCCACCAACTCT GAGT CCAGCACAACCTCCAGTGGGGCCAGCACAGC CACCAAC TC T GAGTCCAG 

5"J AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
U]' GCACAGCCACCAACTCTGACTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAG 
i|| TCCAGCACGACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGG 
ffl GGCTAGCACAGCCACCAACTCTGACTCCAGCACAACCTCCAGTGGGGCCGGCACAGCCACCAACT 
CTGAGTCCAGCACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACACCCTCC 
AGT GGGGCCAACACAGCCACCAACTCTGAGTCCAGTAC GACCTCCAGTGGGGCCAACACAGCCAC 



W CAACTCTGAGTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAGTCCAGCACAA 
W CCTCCAGTGGGGTCAGCACAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCTAGCACA 
Q GCCACCAACTCTGACTCCAGCACAACCTCCAGTGAGGCCAGCACAGCCACCAACTCTGAGTCTAG 
CACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACAACCTCCAGTGGGGCCA 



ru 



ACACAGCCACCAACTCTGGGTCCAGTGTGACCTCTGCAGGCTCTGGAACAGCAGCTCTGACTGGA 
W ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 
SI GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 
F! TCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 
CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
GTGGAGTCCTAACTGGTTCTGGAGGAGACCAGTATCATCGATAGCCATGGAGATGAGCGGGAGGA 
ACAGCGGGCCCTGAGCAGCCCCGGAAGCAAGTGCCGCATTCTTCAGGAAGGAAGAGACCTGGGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
AAT CT TGAAGAAG GTAT TCC T CACC T TTCTTGCCTT TACCAGACACT GGAAAGAGAATAC TATAT 
T G C T C A T T TAG C TAAG AAAT AAAT AC AT C T CAT C T AACAC ACAC G AC AAAG AGAAG CTGTGCTTG 
CCCCGGGGTGGGTATCTAGCTCTGAGATGAACTCAGTTATAGGAGAAAACCTCCATGCTGGACTC 
CAT CTGGCAT T CAAAATCTCCACAGTAAAAT C CAAAGACCT CAAAAAAAAAAAAAAAZW^AAAAA 
AAAAAA&AA&AAAAAAAAAAAAAAAAAAAAA 



AppJD=10063699 



Page 253 of 322 



FIGURE 100 
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MKMQKGNVLLM FGL LLHLEAATNSNE T S TSANT G S S VI S S GAS TATN5 GS S VTS S G VS T AT I S GS 
S VT S NG VS I VTNS E FHT T S S G I S T ATNS EES TAB SGIS I ATNS E S S T T S S GAS T ATNS ES S T P S S 
GAS T VTKS GS S VTS S GAS T ATNS E S S T VS SRAS T ATNSE S S TL S 5 GAS T ATNS D S S T T S S GAS T A 
TNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSES 
STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAGTATNSESSTVSSGISTVTNSESSTPSSGANTATNSESSTTSSGANTATNSESSTVSSGASTA 
TKSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTSSEASTATNSESSTVSSGISTVTNSES 
STTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSASTAVSEAKPGGSLVPWEIFLITLVSWA 
AVGLFAGLFFCVRNSLSLRNTFNTAVYHPHGLNHGLGPGPGGNHGAPHRPRWSPNWFWRRPVSSI 



O AMEMSGRNSGP 



Signal peptide: 

amino acids 1-20 

Transmembrane domain : 

amino acids 510-532 
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FIGURE 101 

GGCCGGAC'GCCTCCGCGTTACGGGATGMTTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGC TGGAAC TAT GAGGCTGAAAAACAGAGTGGGTACT C TC TTCT GGGAAGCTGGCAACAAAT 
GGATGATG T GAT ATATGCAT TCC AGGGGAAGGGAAAT TGTGGTGC T T C TGAACCCAT GG T CAAT T 
AAC G AGGCAGT T T C TAG C T AC T GC ACGT AC TTCATAAAGC AGGAC T CTAAAAGC T T T G GAAT CAT 
GGTGTCATGGAAAGGGATTTACTTTATACTGACTCTGTTTTGGGGAAGCTTTTTTGGAAGCATTT 
T CATGC TGAGTC CCT T T T TACC TT TGATGTTTGTAAACCCATCT TGGTATCGCTGGATCAACAAC 
CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 
GAT T AT AACT GGGGAT GC AT TT GT TCC TGGAGAAAGAAG T G T CAT T ATC AT GAACCATCGGACAA 
GAATGGACTGGATGTTCCTGTGGAATTGCCTGATGCGATATAGCTACCTCAGATTGGAGAAAATT 
TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 
CT T CAT TCATAGGAAATGGAAGGAT GACAAGAGC CAT T T C GAAGACAT GAT T GAT TAC T T T T G T G 
AT AT T CACGAACCACT T CAAC TCCT CAT AT T CCCAGAAGGGACT GAT C T C ACAGAAAACAGC AAG 
T CT CGAAG TAAT GC AT T T GC TGAAAAAAAT GGAC T T CAGAAATATGAAT AT GT T TTACATCCAAG 
AACTACAGGCTTTACTTTTGTGGTAGACCGTCTAAGAGAAGGTAAGAACCTTGATGCTGTCCATG 
ATATCACTGTGGCGTATCCTCACAACATTCCTCAATCAGAGAAGCACCTCCTCCAAGGAGACTTT 
CCCAGGGAAATCCACTTTCACGTCCACCGGTATCCAATAGACACCCTCCCCACATCCAAGGAGGA 
CCT TCAAC T C T GGT GCCACAAACGGTGGGAAGAGAAAGAAGAGAGGC T GCGT T CC TT CTATC AAG 
GGGAGAAGAAT T T T TAT T T T ACCGGACAGAGTGTCATTCCACCTTGCAAGTCTGAACTCAGGGTC 
CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
ATATTTGTACAGTCTTGTTAAGTGGTATTTTATAATCACCATTGTAATCTTTGTGCTGCAAGAGA 
GAATATTTGGTGGACTGGAGATCATAGAACTTGCA.TGTTACCGACTTTTACACAAACAGCCACAT 
TTAAATTCAAAGAAAAATGAGTAAGAT TATAAGGT TTGCCATGTGAAAACCTAGAGCATAT T T T G 
GAAATGT T C TAAAC C T T T C T AAGC T CAGATGCAT TT T TGC ATG AC TAT GT CGAAT AT T T CT T ACT 
GCCATCATTATTTGTTAAAGATATTTTGCACTTAATTTTGTGGGAAAAATATTGCTACAATTTTT 
TTTAATCTCTGAATGTAATTTCGATACTGTGTACATAGCAGGGAGTGATCGGGGTGAAATAACTT 
G G GCC AGAAT AT TAT T AAAC AAT CAT C AGG CT T T TAAA 
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FIGURE 102 



MHSRGREIWLLNPWSINEAVSSYCTYFIKQDSKSFGIMVSWKGIYFILTLFWGSFFGSIFMLSP 
FLPLMFVNPSWYRWINNRLVATWLTLPVALLETMFGVKVI ITGDAFVPGERSVI IMNHRTRMDWM 
FLWNCLMRYSYLRLEKICLKASLKGVPGFGWAMQAAAYIFIHRKWKDDKSHFEDMIDYFCDIHEP 
LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDITYA 
YPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHKRWEEKEERLRSFYQGEKNF 
YFTGQSVIPPCKSELRVLWKLLSILYWTLFSPAMCLLIYLYSLVKWYFIITIVIFVLQERIFGG 
LEI IELACYRLLHKQPHLNSKKNE 



*4 Important features of the protein: 



Signal peptide: 

amino acids 1-22 



biz* 

m 
w 

f|| Transmembrane domains : 

jtQ amino acids 44-63, 90-108, 354-377 

o 



O 

m 
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FIGURE 1Q3 



CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 
TCCAAATCATCCATCCACCCCTGCTGTCATCTGTTTTCATAGTGTGAGATCAACCCACAGGAATA 
TCCATGGCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
CACCTCTACAGAGAT GGGGAAGACTGGGAATCTAAGCAGATGCCACAG TATCGAGGGAGAACTGA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
GT T ACTCTGCCT G T C C TCAGGC T G GT T C C C CCAGCCC ACAJ3CCAAGT GGAAAGGTC CACAAGGAC 
AGGATTTGTCTTCAGACTCCAGAGCAAATGCAGATGGGTACAGCCTGTATGATGTGGAGATCTCC 
ATTATAGTCCAGGAAAATGCTGGGAGCATATTGTGTTCCATCCACCTTGCTGAGCAGAGTCATGA 
GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
#| TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
'JJ AAATCCAAAGGGAAAAT CCAGGCG GAACT GGAC T GGAGAAGAAAGCACGGACAGGC AGAAT T GAG 

fl AGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGAAGCTCTGCG 
ill TTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTGCCTCACTCTGAGAAGAGA 
01 TT T ACAAGGAAGAGT GTGG TGGCTTC TCAGGG T T T CCAAGCAGGGAGAC ATT AC TGGGAGGTGGA 

\ Vi CGTGGGACAAAATGTAGGGTGGTATGTGGGAGTGTGTCGGGATGACGTAGACAGGGGGAAGAACA 
>n ATGTGACTTTGTCTCCCAACAATGGGTATTGGGTCCTCAGACTGACAACAGAACATTTGTATTTC 
M*. ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
f m GGAC TATGAGGGTGGGACCAT CT CC T T C T TCAATACAAAT GACCAGT CC CT TAT T TATACCCTGC 

|?J T GACATGT CAG T T T GAAGGC T TGT TGAGACCC TAT AT CCAGCATGCGAT GTATGACGAGGAAAAG 

j|| GGGACTCCCATATTCATATGTCCAGTGTCCTGGGG ATGA GACAGAGAAGACCCTGCTTAAAGGGC 
CCCACACCACAGACCCAGACACAGCCAAGGGAGAGTGCTCCCGACAGGTGGCCCCAGCTTCCTCT 



m 



CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 
P T CAGAAGCCATGGCT GC CC T GAAGT G GGGACGGAAT AGAC TCACAT TAGG T T TAGT T TGT GAAAA 

jjj CT CCAT CC AGCT AAGC GAT C T T GAACAAGTC ACAAC C T CC CAGGC TC C T CAT T T GC TAGT CACGG 

ACAGTGATTCCTGCCT CACAGGT GAAGAT TAAAGAGACAACGAATGT GAAT CATGCT TGCAGGT T 
T GAGGGCACAG T G T T T GC TAATGAT G T G T T T T TATAT TAT ACAT T T T CCCACCAT AAACT CT GT T 
TGCTTATTCCACATTAATTTACTTTTCTCTATACCAAATCACCCATGGAATAGTTATTGAACACC 
T G C T T TGT GAG GC T CAAAG AAT AAAGAG GAGG TAG GAT T T T T C AC T GAT T C T ATAAGCCC AGCAT 
T AC C TGATACCAAAAC CAGGC AAAGAAAACAGAAGAAGAGGAAGGAAAAC T ACAGG TC CATATCC 
C T CAT TAACACAGAC AC AAAAAT TC T AAATAAAATT TT AACAAAT TAAAC T AAAC AATATAT T T A 
AAGATGATATATAACTACTCAGTGTGGTTTGTCCCACAAATGCAGAGTTGGTTTAATATTTAAAT 
ATCAACCAGTGTAATTCAGCACATTAATA2^GTAAAAAAGAAAACCATAA71AAAAAAAAAAAA 
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FIGURE lOA 
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MAFVLILVLSFYELVSGQWQVTGPGKFVQALVGEDAVFSCSLFPETSAEAMEVRFFRNQFHAVVH 
LYRDGEDWESKGMPQYRGRTEFVKDSIAGGRVSLRLKNITPSDIGLYGCWFSSQIYDEEATWELR 
VAALGSLPLISIVGYVDGGIQLLCLSSGWFPQPTAKWKGPQGQDLSSDSRANADGYSLYDVEISI 
IVQENAGSILCSIHLAEQSHEVESKVLIGETFFQPSPWRLAS ILLGLLCGALCGWMGMI IVFFK 
SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQGFQAGRHYWEVDVGQWGWWGVCRDDVDRGKNWTLSPNNGYWLRLTTEHLYFT 
FNPHFISLPPSTPPTRVGVFLDYEGGTISFFNTNDQSLIYTLLTCQFEGLLRPYIQHAMYDEEKG 
TPIFICPVSWG 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 131-150, 235-259 
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FIGURE log 



CCTT CAC AGGACTC T T CATTGCT GGT TGGC AATG AT GT AT CGGCCAGATGTGGTG AGGGC TAGG AAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CTCAC T GTTCATTAT GT GAG AT AT AATC AAAAG AAG ACCT AC AATT ACT AT AGC ACATTGT C ATTTAC AAC 
TGACAAACTATATGCTGAGTTTGGCAGAGAGGCTTCTAACAATTTTACAGAAATGAGCCAGAGACTTGAAT 
CAATGGTGAAAAATGCATTTTATAAATCTCCATTAAGGGAAGAATTTGTCAAGTCTCAGGTTATCAAGTTC 
AGTCAACAGAAGCATGGAGTGTTGGCTCATATGCTGTTGATTTGTAGATTTCACTCTACTGAGGATCCTGA 
AACT GT AG ATAAAATT GT TC AACT TGTT TTACATGAAAAGCT GCAAGATGCT GT AGGACC CC CT AAAGT AG 
ATCCTCACTCAGTTAAAATTAAAAAAATCAACAAGACAGAAACAGACAGCTATCTAAACCATTGCTGCGGA 
ACACGAAGAAGT AAAACT CT AGGT CAGAGTC T CAGGATCGT T GG TGGGAC AGAAGT AGAAGAGGGTG AAT G 
GCCCTGGCAGGCTAGCCTGCAGTGGGATGGGAGTCATCGCTGTGGAGCAACCTTAATTAATGCCACATGGC 
Q TTGTGAGTGCTGCTCACTGTTTTACAACATATAAGAACCCTGCCAGATGGACTGCTTCCTTTGGAGTAACA 
|f! ATAAAACCTTCGAAAATGAAACGGGGTCTCCGGAGAATAATTGTCCATGAAAAATACAAACACCCATCACA 
p| TGACTATGATATTTCTCTTGCAGAGCTTTCTAGCCCTGTTCCCTACACAAATGCAGTACATAGAGTTTGTC 
TCCCTGATGCATCCTATGAGTTTCAACCAGGTGATGTGATGTTTGTGACAGGATTTGGAGCACTGAAAAAT 
GATGGT T ACAGTCAAAAT C ATCTT CGAC AAGC ACAGGT GACTCT CATAGACGCTACAAC T TGC AATGAAC C 
TCAAGCTTACAATGACGCCATAACTCCTAGAATGTTATGTGCTGGCTCCTTAGAAGGAAAAACAGATGCAT 

n 

m GCCAGGGTGACTCTGGAGGACCACTGGTTAGTTCAGATGCTAGAGATATCTGGTACCTTGCTGGAATAGTG 
Isf AGCT GG GG AG ATG AAT GT GCGAAACC CAAC AAGCCT GGTGTT TAT ACT AGAGTT ACGGCC TT GC GGGACT G 



a 



i.3 



GATTACTTCAARAflCTGGTATCTARGAGACRAARGCCTCATGGAACRGRTRACRTTTTTTTTTGTTTTTTG 



m 

m GGTGT GGAGGCC AT T T TT AG AG AT AC AGAATT GG AGAAGACTTGCAAAACAGCT AG ATTT GAC TGATCT C A 

?sr>? 

g| ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 
ffj GATCAACTCTGTCATCTGTGAGCAATAGTTGAAACTTTATGTACATAGAGAAATAGATAATACAATATTAC 
ATTACAGCCTGTATTCATTTGTTCTCTAGAAGTTTTGTCAGAATTTTGACTTGTTGACATAAATTTGTAAT 
GCATATATACAATTTGAAGCACTCCTTTTCTTCAGTTCCTCAGCTCCTCTCATTTCAGCAAATATCCATTT 
TCAAGGTGCAGAACAAGGAGTGAAAGAAAATATAAGAAGAAAAAAATCCCCTACATTTTATTGGCACAGAA 
AAGT ATT AGGTGTT TT TC TT AGTGGAAT ATTAGAA&TGAT CAT ATT CAT TAT GAAAGGTC AAGC AAAG AC A 
GCAGAATACCAATCACTTCATCATTTAGGAAGTATGGGAACTAAGTTAAGGAAGTCCAGAAAGAAGCCAAG 
ATATATCCTTATTTTCATTTCCAAACAACTACTATGATAAATGTGAAGAAGATTCTGTTTTTTTGTGACCT 
AT AAT AATTATACAAACTTC AT GC AATGT ACT T GT T CT AAGC AAATTAAAGC AAAT AT T TAT TTAACATT G 
TTACTGAGGATGTCAACATATAACAATAAAATATAAATCACCCA 
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FIGURE 106 
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MMYRPD WRARKRVC WE PWV I GLVI FI S L I VLAVCI GLT VHYVRYNQKKTYNYY S TLS FT T DKLY 
AE FGREASNNFTEMS QRLE SMVKNAFYKS PLREE FVKSQV I KFS QQKHGVIAHMLL I CRFHS TE D 
PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TEVEEGEWPWQASLQWDGSHRCGATLINATWLVSAAHCFTTYKNPARWTASFGVTIKPSKMKRGL 
RRIIVHEKYKHPSHDYDISLAELSSPVPYTNAVHRVCLPDASYEFQPGDVMFVTGFGALKNDGYS 
QNHLRQAQVTL I DAT TCNE PQAYNDAI T PRMLCAGS LEGKTDACQGDS GGPLVS S DARD I WYLAG 
IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 

Transmembrane domain: 



f?-l amino acids 21-40 (type II) 

m 
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FIGURE 107 



a 
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AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
ATAATGT TCTCACT T T AAC T G CAGAAAC GAGGGTAGAGGAAGCAG T CAT T T TGAC T TACT T T CCT 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
GTGTAGAACTGGCTTGTGGCGTTTGGACATATGAACAGGAACTTATGGTTCCAGTACAATGGTCA 
GATATGGTCACTTTGAAAGCCAGGATGACAAATTATGGATTACCTAGATATCGGTGGCTTACTCA 
T GC T TGGAAT TT T T T T CAGAGAG AGT T TAAGTGCTGTGGAG TAGTAT AT T T CAC TGACTGG T TGG 
AAAT GACAGAG AT GGAC T G GCC C C C AG AT TCCTGCTGTGT T AGAGAAT TC C CAGGAT G T T CC AAA 
i> CAGGCCCACCAGGAAGATCTCAGTGACCTTTATCAAGAGGGTTGTGGGAAGAAAATGTATTCCTT 
TTTGAGAGGAACCAAACAACTGCAGGTGCTGAGGTTTCTGGGAATCTCCATTGGGGTGACACAAA 
TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
ACAGACCAAATGAT GTCCTTGAAGAAT GACAACTCT CAGCACC TGTCATGT CCCTCAGTAGAAC T 
|fi GTTGAAACCAAGCCTGTCAAGAATCTTTGAACACACATCCATGGCAAACAGCTTTAATACACACT 
111 TTGAGATGGAGGAGTTATAAAAAGAAATGTCACAGAAGAAAACCACAAACTTGTTTTATTGGACT 
%t% T GT GAAT T TT TGAGTACATAC TATGTGTT TCAGAAATATGTAGAAAT AAAAATGT TGCCATAAAA 

U l TAACACC TAAGCATAT AC TAT T C T ATGCTT T AAAAT GAGGATGGAAAAGTT T CATGTCATAAGTC 

W ACCACCTGGACAATAATTGATGCCCTTAAAATGCTGAAGACAGATGTCATACCCACTGTGTAGCC 
T G T GTAT G AC TT T TACT GAACAC AGT T ATGT T TTGAGGCAGCAT GGT TT GAT TAGCAT T T CCGCA 
n T CC ATGC AAACGAGT C ACATAT GG T GGGAC TGGAG C CAT AG TAAAGGT TG AT T T AC T T CT ACCAA 

%H C TAG T AT ATAAAG T AC TAAT T AAAT G C TAAC A T AG GAAG T T AGAAAAT AC T AAT AACT T T TAT T A 

f;f CTCAGCGATCTATTCTTCTGATGCTAAATAAATTATATATCAGAAAACTTTCAATATTGGTGACT 
ACCTAAATGTGATTTTTGCTGGTTA.CTAAAATATTCTTACCACTTAAAAGAGCAAGCTAACACAT 
P TGTCTTAAGCTGATCAGGGATTTTTTGTATATAAGTCTGTGTTAAATCTGTATAATTCAGTCGAT 
|Q T T C AGT T C T GAT AAT GT T AAGAAT AAC CA.T TAT GAAAAG GAAAAT TTGTCCTG TAT AG CAT CAT T 

pi ATTTTTAGCCTTTCCTGTTAATAAAGCTTTACTATTCTGTCCTGGGCTTATATTACACATATAAC 
T GT TAT T TAAAT AC T T AACC ACT AAT T T TGAAAAT T ACCAG TG T GAT AC AT AGGAAT CAT TAT T C 
AGAATGTAGTC TGGT C T T TAGGAAGTA.T TAATAAG AAAAT T T GCACATAACT TAGT TGAT T C AGA 
AAGGAC T TGTAT GCTGT T T TTCT CCCAAATGAAGAC TCTTT T TGACACTAAACACT T T T TAAAAA 
GC T TAT CTTTGCCTTC T CC AAAC AAG AAGC AAT AG T C T C C AAG T CAAT ATAAAT T C T AC AGAAAA 
T AGTGT TCTTTTTCTC CAGAAAAAT GC T T G T GAGAATC AT T AAAACATGTG ACAAT T TAGAGAT T 
C T T T GTT T TATT TC AC TGAT TAATATAC TG TGGCAAAT T AC ACAGATT AT TAAAT T T T T T T ACAA 
GAGTATAGTATATTTATTTGAAATGGGAAAAGTGCATTTTACTGTATTTTGTGTATTTTGTTTAT 
TT CTCAGAATAT GGAAAGAAAAT TAAAATGTGTCAATAAATAT T T TC TAGAGAG TAA 
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FIGURE 108 



MAREDSVKCLRCLLYALNLLFWLMSISVLAVSAWMRDYLNNVLTLTAETRVSEAVILTYFPVVHP 
VMI AVCCFLI IVGMLGYCGTVKRKLLLLAWYFGSLLVI FCVELACGVWTYEQELMVPVQWSDMVT 
LKARMTNYGLPRYRWLTHAWNFFQREFKCCGWYFTDWLEMTEMDWPPDSCCVREFPGCSKQAHQ 
EDLSDLYQEGCGKKMYSFLRGTKQLQVLRFLGISIGVTQILAMILTITLLWALYYDRREPGTDQM 
MSLKNDNSQHLSCPSVELLKPSLSRI FEHTSMANS FNTHFEMEEL 

Signal peptide: 

amino acids 1-33 



m Transmembrane domains: 

Iff avxino acids 12-35, 57-86, 94-114, 226-248 

lit 

flV 
I 
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FIGURE 1QQ 



CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GATAAAGTCTTTCTCGTGAGAGCCTAGAGGCCTTAAAAAAAAAAGTGCTTGAAAGAGAAGGGGACAAAGGAACA 
CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGGATGCCTCCATTCCTGCTTCTCACCTG 
CCTCTTCATCACAGGCACCTCCGTGTCACCCGTGGCCCTAGATCCTTGTTCT GCTTACATCAGCCTGAATGAGC 
CCT GGAGGAAC ACT GACC ACC AGTT GGAT GAGT CTCAAGGTCCTCCT CTATGT GAC AACCATGT GAATGGGGAG 
T GGT AC C ACT TC AC G GGC AT GGC GGGAGAT GCC AT GCC T AC CT TCT GC AT ACC AGAAAAC CACT GT GGAAC C C A 
CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 
TCAATGGGAACTGCTGTCTCTGGAACACCACGGTGGAAGTCAAGGCTTGCCCTGGAGGCTACTATGTGTATCGT 
CTGACC AAGCCCAGC GTCT GCT T CCACGT CT ACT GT GGT C ATT TT T AT GACAT C T GC GAC GAGG ACTGCC AT GG 
C AGCT GCT C AGAT AC C AGCGAGT GCAC ATGC GC TC C AGGAACT GT GC T AGGC CCT GAC AGGCAG AC AT GCT T T G 
ATGAAAAT GAATGT G AGCAAAAC AAC GGT GGCT GC AGT G AGAT CT GT GT GAACC T CAAAAACT C C T AC C GC T GT 
GAGTGT GGGGTT GGC C GT GT GCT AAGAAGT GAT GGC AAGACTT GT GAAGACGT TGAAGGAT GC C AC AAT AACAA 
TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGTGAATGTCCCCGGGGCCTGGTGCTGT 
CTGAGGATAACCACACTT GCCAAGTCCCTGT GTTGT GCAAATCAAAT GCCATT GAAGT G AAC AT C C CC AG GG AG 

m 

¥ CTGGTT GGTGGCCTG GAGCTCTT CCT GACCAACACCTCCT GCC GAGGAGT GTCCAACGGC ACCCATGT CAACAT 

^.W CCTCTTCTCTCTCAAGACATGTGGTACAGTGGTCGATGTGGTGAATGACAAGATTGTGGCCAGCAACCTCGTGA 
CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCATCATCCGAACCAGCAAGCTGCTGATCCCGGTG 
ACCTGCGAGTTTCCACGCCTGTACACCATTTCTGAAGGATACGTTCCCAACCTTCGAAACTCCCCACTGGAAAT 
CAT GAGC CGAAAT CAT G GGAT CTTCCCATT C AC TC T GGAGAT C TTC AAGGAC AAT GAGTT TGAAGAGCCTT ACC 
GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 
TTGGAAAGCTTGGTGGAGAGCTGCTTTGCCACCCCCACCTCCAAGATCGACGAGGTCCTGAAATACTACCTCAT 
CCGGGATGGCTGTGTTTCAGATGACTCGGTAAAGCAGTACACATCCCGGGATCACCTAGCAAAGCACTTCCAGG 
TCCCTGTCTTCAAGTTTGTGGGCAAAGACCACAAGGAAGTGTTTCTGCACTGCCGGGTTCTTGTCTGTGGAGTG 
TTGGACGAGCGTTCCCGCTGTGCCCAGGGTTGCCACCGGCGAATGCGTCGTGGGGCAGGAGGAGAGGACTCAGC 
C GGTC T ACAGGGC CAGAC GCT AAC AGGCGGCCCGATCC GC AT C GACT G GGAGGAC TAG T T C GT AGC C AT AC CT C 
GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
T GGGTT CAGACTT CACACTGT GAGT T C AG AC TCCC AGC ACCAACT C ACT CTGATT CT GGT CCAT T CAGTGGGC A 
CAGGTCACAGCACTGCTGAACAATGTGGCCTGGGTGGGGTTTCATCTTTCTAGGGTTGAAAACTAAACTGTCCA 
CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGTGCAATCAGAC 
C AC AAAAT CAGAAGCT GGGT AT AATATT T CAAGTT ACAAAC C C T AGAAAAAT T AAAC AGT T AC T GAAATT AT GA 
CTTAAATACCCAATGACTCCTTAAATATGTAAATTATAGTTATACCTTGAAATTTCAATTCAAAT GCAGACTAA 
T T AT AGG GAAT TT GG AAGT GT AT C AAT AAAAC AGT AT AT AATT TT 



m 
la 



% 

if! 
0 
KJ 

hi- 
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FIGURE no 



MPPFLLLTCLFITGTSVSPVALDPCSAYISLNEPWRNTDHQLDESQGPPLCDNHVNGEWYHFTGMAGDAM'P 
TFCIPENHCGTHAPVWLNGSHPLEGDGIVQRQACASFNGNCCLWNTTVEVKACPGGYYVYRLTKPSVCFHV 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVNLKNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNIPRELVGG 
LELFLTNTSCRGVSNGTHVNILFSLKTCGTVVDVVNDKIVASNLVTGLPKQTPGSSGDFI irtsklli pvt 
CEFPRLYTISEGYVPNLRNSPLEIMSRNHGIFPFTLEIFKDNEFEEPYREALPTLJCLRDSLYFGIEPWHV 
SGLESLVESCFATPTSKIDEVLKYYLIRDGCVSDDSVKQYTSRDHLAKHFQVPVFKFVGKDHKEVFLHCRV 
LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPIRIDWED 

Important features of the protein: 



"ffk Signal peptide: 



amino acids 1-16 



m 
w 

Ol N-glycosylation sites. 

Sfcl amino acids 89-93, 116-120, 259-263, 291-295, 299-303 

b 



Tyrosine kinase phosphorylation sites. 

amino acids 411-418, 443-451 



N-myristoylation sites. 

amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 
|y 522-528, 531-537 

Aspartic acid and asparagine hydroxylation site. 

amino acids 197-209 



ZP domain proteins. 

amino acids 431-457 



Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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FIGURE ill 



GAGAGAGGCAGCAGCTTGCTCAGCGGACAAGGATGCTGGGCGTGAGGGACCAAGGCCTGCCCTGCACTCGG 
GCCTCCTCCAGCCAGTGCTGACGAGGGACTTCTGACCTGCTGGCCAGCCAGGACCTGTGTGGGGAGGCCCT 
CCTGCTGCCTTGGGGTGACAATCTCAGCTCCAGGCTACAGGGAGACCGGGAGGATCACAGAGCCAGC ATGT 
TACAGGATCCTGACAGTGATCAACCTCTGAACAGCCTCGATGTCAAACCCCTGCGCAAACCCCGTATCCCC 
ATGGAGACCTTCAGAAAGGTGGGGATCCCCATCATCATAGCACTACTGAGCCTGGCGAGTATCATCATTGT 
GGTTGTCCTCATCAAGGTGATTCTGGATAAATACTACTTCCTCTGCGGGCAGCCTCTCCACTTCATCCCGA 
GGAAGCAGCTGTGTGACGGAGAGCTGGACTGTCCCTTGGGGGAGGACGAGGAGCACTGTGTCAAGAGCTTC 
CCCGAAGGGCCTGCAGTGGCAGTCCGCCTCTCCAAGGACCGATCCACACTGCAGGTGCTGGACTCGGCCAC 
, ( ' AGGGAACTGGTTCT CT GC CT GTTT CGAC AACTTCACAGAAGCTC TCGC TG AG AC AG CCT GTAGGCAGAT GG 

; m GCTACAGC AG AGCT GT GG AG ATTGGC CC AG AC CAGGATCTGG ATGTTGT T GAAATC AC AGAAAACAGC CAG 

lesf 

fBfc GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 



y 
if! 
0 
CO 

D 

rii 



TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 
TCAGCATCCAGTACGACAAACAGCACGTCTGTGGAGGGAGCATCCTGGACCCCCACTGGGTCCTCACGGCA 



y 8 GCCCACTGCTTCAGGAAACATACCGATGTGTTCAACTGGAAGGTGCGGGCAGGCTCAGACAAACTGGGCAG 
41 CTTCCCATCCCTGGCTGTGGCCAAGATCATCATCATTGAATTCAACCCCATGTACCCCAAAGACAATGACA 
« TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 



GATGAGGAGCTCACTCCAGCCACCCCACTCTGGATCATTGGATGGGGCTTTACGAAGCAGAATGGAGGGAA 
GATGTCTG AC AT ACTGCT GC AGGC GT CAGT CC AGGT CATT GACAGC AC ACGGT G CAAT GC AGACGATGCGT 
ACCAGGGGGAAGTCACCGAGAAGATGATGTGTGCAGGCATCCCGGAAGGGGGTGTGGACACCTGCCAGGGT 



i% GACAGTGGTGGGCCCCTGATGTACCAATCTGACCAGTGGCATGTGGTGGGCATCGTTAGCTGGGGCTATGG 
£?| CTGCGGGGGCCCGAGC AC CC CAG GAGT ATACACCAAGGTCTC AG CCT AT CT C AACT GGAT CT ACAATGTCT 

GGAAGGCTGAGCTG TAA TGCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGGATCCCCCAAAGTCAGACACAGAGCAAGAGTCCCCTTGGGTACACCCCTCTGCCCACAGCCTCAGCAT 
TTCTTGGAGCAGCAAAGGGCCTCAATTCCTGTAAGAGACCCTCGCAGCCCAGAGGCGCCCAGAGGAAGTCA 
GCAGCCCTAGCTCGGCCACACTTGGTGCTCCCAGCATCCCAGGGAGAGACACAGCCCACTGAACAAGGTCT 
CAGGGGTATTGCTAAGCCAAGAAGGAACTTTCCCACACTACTGAATGGAAGCAGGCTGTCTTGTAAAAGCC 
CAGATCACTGTGGGCTGGAGAGGAGAAGGAAAGGGTCTGCGCCAGCCCTGTCCGTCTTCACCCATCCCCAA 
GCCTACTAGAGCAAGAAACCAGTTGTAATATAAAATGCACTGCCCTACTGTTGGTATGACTACCGTTACCT 
ACTGTT GTCATTGT T ATT AC AG CT AT GGC CACTATTATTAAAGAGCT GTGT AAC AT CT CT GGCAAAAAAAA 
AAAA 
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FIGURE 112 



MLQDPDSDQPLNSLDVKPLRKPRI PMETFRKVG I PI 1 I ALLSLAS 1 1 1 VWL I KV 1 LDKY Y FLCG 
QPLHFIPRKQLCDGELDCPLGEDEEHCVKSFPEGPAVAVRLSKDRSTLQVLDSATGNWFSACFDN 
FTEALAETACRQMGYSRAVEIGPDQDLDVVEITENSQELRMRNSSGPCLSGSLVSLHCIjACGKSL 
KT PRWGGEEAS VPS WPWQVS I QYDKQHVCGG S I LDPHWVL TAAHC FRKHTDVFNWKVRAG S DKL 
GS FPSLAVAKII I IEFNPMYPKDNDIALMKLQFPLTFSGTYRPICLPFFDEELTPATPLWIIGWG 
FTKQNGGKMSDILLQASVQVIDSTRCWADDAYQGEVTEKMMCAGIPEGGVDTCQGDSGGPLMYQS 
DQWHWG I VSWGYGCGGP S T PGVYTKVS AYLNW I YNVWKAE L 

Transmembrane domain; 



£tj amino acids 32-53 (typell) 

CI 
If! 
W 

m 

kit 
. fh 

4i 



in 
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FIGURE lis 



GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
TAGGTGTAAGCCACCGTGTCTGGGCTCTGAACAACTTTTTCAGCAACTAAAAMGCCACAGGAGT 
TGAACTGCTAGGATTCTGACTATGCTGTGGTGGCTAGTGCTCCTACTCCTACCTACATTAAAATC 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GG C CC AAAC C TGAC CT TCACTCT GGAACGAGAACAGAGG T T T C T ACC CAC AC CG T C CC C T C GAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
CTTGCCCTGGCCGTAGAAGGGATTGACAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
M AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 

n 

J£| GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACCTTTTCATGTCCTGCACATCACCTG 
5l| ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 
| s y CAGAAGGGGTCTGCTTAGACCACCTGGTTTATGTGACAGGACTTGCATTCTCCTGGAACATGAGG 
fll GAACGCCGGAGGAAAGCAAAG TGG CAGGGAAGGAACT TG T GC CAAAT TATGGGTCAGAAAAGAT G 

GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTG 
CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 



f?^ CCACGTACCAGCTGCTGAAGGGCAACTGCAGGCCGATGCTCTCATCAGCCAGGCAGCAGCCAAAA 
Ijl TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 
CI CTCCTTCCCTCTGAGAGGCCCTCCTATGTCCCTACTAAAGCCACCAGCAAGACATAGCTGACAGG 



SO 

o 
111 



GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 
GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 
C T C AAT T TAAAT CATGT T C T AGT AAT T GGAGC T G T CCCCAAGACCAAAGGAG C TAGAGC TT GG T T 
CAAATGATCTCCAAGGGGCCTTATACCCCAGGAGACTTTGATTTGAATTTGAAACCCCAAATCCA 
AAC C TAAGAACC AGGT GCAT T AAGAAT CAG T TATT GC C GGGTGTGGTGGCC T GTAATGC CAACAT 
TT T GGGAGGCCGAGGCGGGTAGATCACCTGAGGTCAGGAGT TCAAGACCAGCCTGGCCAACATGG 
TGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
C CAGC TAC T CGGGAGGC TGAGAC AGGAGAAT TAG T T GAACC T GGGAGGT GAAGGAGGC T GAGAC A 
GG AGAAT CAC T T CAGC CT GAG C AAC AC AGCGAGAC T C T G T C T C AGAAAAAAT AAAAAAAG AAT T A 
TGGTTATTTGTAA 
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FIGURE lid 

MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 

Signal peptide: 

amino acids 1-15 



If! 
U 

m 



if! 
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FIGURE 115 



CAGCAGTGGTCTCTCAGTCCTCTCAAAGCaAGGAAAGAGTACTGTGTGCTGAGAGACCATGGCAA 
AG AAT C C T C C AG AG AAT T G T G AAG AC T G T CACAT T C T AAAT GC AGAAG C T T T T AAAT C CAAG AAA 
AT ATGTAAATCAC T TAAGAT T TGT GGACT GGT GTT T GGTAT CC T GGC CC TAAC T C TAAT T G T CC T 
GTTTTGGGGGAGCAAGCACTTCTGGCCGGAGGTACCCAAAAAAGCCTATGACATGGAGCACACTT 
T C T ACAGCAAT GGAG AGAAGAAGAAG AT T TACATGGAAAT T GATCC T G T GACCAGAAC TG AAAT A 
T T C AGAAG CGGAAATGGCACT GATGAAACAT T GGAAGT GC ACGAC T T T AAAAACGGATACAC T GG 
CATCTACTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGATTAAAGTGATTCCTGAATTTT 
C T G AAC C AGAAG AGG AAAT AG AT GAG AAT GAAGAAAT T AC C AC AAC T T T C T T T GAAC AG T CAG T G 
F~ ATTTGGGTCCCAGCAGAAAAGCCTATTGAAAACCGAGATTTTCTTAAAAATTCCAAAATTCTGGA 
GATTTGTGATAACGTGACCATGTATTGGATCAATCCCACTCTAATATCAGTTTCTGAGTTACAAG 
AC T T TGAGGAGG AGGGAGAAGAT C TT CACTTTCC TGCCAAC GAAAAAAAAGGGAT TGAACAAAAT 
f 5 i| GAACAG TGGG TGG T CC C T CAAGT GAAAG TAGAGAAGACCCGT CACGCCAGACAAGCAAG T GAGGA 

01 AGAACTTCCAATAAATGACTATACTGAAAATGGAATAGAATTTGATCCCATGCTGGATGAGAGAG 
: '* l f GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 



GGCTACTACCCA.TATCCATACTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTG 
T AACT GGT GGGTGGCCC GCATGC T GGGGAGGG T C TAA TAGGAGGT T T GAG CT CAAATGC T T AAAC 
TGCTGGCAACATATAATAAATGCATGCTATTCAATGAATTTCTGCCTATGAGGCATCTGGCCCCT 



in. 

t^l GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 
W TTATCACCAAAAAAAAAAAAAAAAAAA 

m 

m ■ 
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FIGURE 116 



m 



rii 



MAKNPPENCEDCHILNAEAFKSKKICKSLKICGLVFGILALTLIVLFWGSKHFWPEVPKKAYDME 
HTFYSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKKTGYTGIYFVGLQKCFIKTQIKVIP 
EFSE PEEE IDENEE I TTT FFEQSVIVJVPAEKP IENRDFLKNSKI LE ICDNVTMYWTNP TL I SVSE 
LQDFEEEGEDLHFPMEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCC 1 YCRRGNRYCRRVCE PLLGYYP YPYC YQGGRVI CRVIMPCNWWVARMLGRV 

Important features of the protein: 
Signal peptide: 



( sp jj amino acids 1-40 

m 

tor Transmembrane domain: 

m 
Id 

m 



amino acids 25-47 (type II) 



N-glycosylation sites. 



bp amino acids 94-97, 180-183 



m 242 



Glycosaminoglycan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 



N-myristoylation sites, 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C-terminal targeting signal. 

amino acids 315-317 



Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 



GAGCTCCCCTCAGGAGCGCGTTAGCTTCACACCTTCGGCAGCAGGAGGGCGGCAGCTTCTCGCAGGCGGCA 
GGGCGGGCGGCCAGGATC ATG TCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 
GGCCGGCTGCATCGCGGCCACCGGGATGGACATGTGGAGCACCCAGGACCTGTACGACAACCCCGTCACCT 
CCGT GT TCC AGT AC GAAGGGCTCT GGAGGAGCTGCGTGAGGC AGAGTTC AGGCTTCACCGAAT GC AGGCC C 
TATT TC ACCATCCT GGGACTTCCAGC CAT GCT GCAGGCAGTG CGAG CCCT GATGATCGTAGGCATCGTCCT 
GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTG 
CC AAAGC C AACATGAC AC TGACCT C C GGGATC ATGT TC AT TGTCTC AGGT CT T TGT GCAATT GCT GGAGT G 
TCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCGGCATGGGTGG 
GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 
TCACACTAATTGGGGGTGTGATGATGTGCATCGCCTGCCGGGGCCTGGCACCAGAAGAaACCAACTACAAA 
GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 
f|| TGGGTCCAACACCAAAAACAAGAAGATATACGATGGAGGTGCCCGCACAGAGGACGAGGTACAATCTTATC 
y| ' CTTCCAAGCACGACTATGTGTAATGCTCTAAGACCTCTCAGCACGGGCGGAAGAAACTCCCGGAGAGCTCA 
CCCAAAAAACAAGGAGATCCCATCTAGATTTCTTCTTGCTTTTGACTCACAGCTGGAAGTTAGAAAAGCCT 



HA CGATTTCATCTTTGGAGAGGCCAAATGGTCTTAGCCTCAGTCTCTGTCTCTAAATATTCCACCATAAAACA 

i fl 

%¥ GCTGAGTT AT TT AT GAAT T AGAGGCT AT AGCTC ACATT T TC AATCCTCT ATT T CTTTTTT T AAAT ATAACT 

n TTCTACTCTGATGAGAGAATGTGGTTTT AATCTCTCTCTCAC ATTTTGAT GATTTAGACAGACT CC CCCT C 

TTCCTCCTAGTCAATAAACCCATT GATGATCT ATTT CCCAGCTTATCCCCAAGAAAACTT TTGAAAGGAAA 
GAGTAGACCCAAAGATGTTATTTTCTGCTGTTTGAATTTTGTCTCCCCACCCCCAACTTGGCTAGTAATAA 
ACACTTACTGAAGAAGAAGCAATAAGAGAAAGATATTTGTAATCTCTCCAGCCCATGATCTCGGTTTTCTT 



if! 
o 
m 



ACAC TGTGAT CTTAAAAGTT ACCAAACCAAAGTCAT TT T C AG TTTG AG GC AACC AAACCT TT CT ACTGCT G 
ffj TTGACATCTTCTTATTACAGCAACACCATTCTAGGAGTTTCCTGAGCTCTCCACTGGAGTCCTCTTTCTGT 
CGCGGGTC AGAAAT T GTC CCTAGATGAAT GAGAAAATT AT TT TTTT TAAT TT AAGT CCTAAAT ATAGTTAA 
AATAAAT AATGTT T TAGT AAAATGAT AC ACTATCTC TGTGAAAT AG CC TC AC C CCT ACAT GT GGATAG AAG 
GAAATGAAAAAATAATTGCTTTGACATTGTCTATATGGTACTTTGTAAAGTCATGCTTAAGTACAAATTCC 
AT GAAAAGCT CACACCTGT AAT CC TAGC ACTT TGGG AGGCT G AGGAGGAAGGAT C ACTTGAGCC CAGAAGT 
TC GAGACT AGCCT GGGCAAC ATGGAGAAGCCCTGT CTC T AC AAAATACAGAGAGAAAAAATC AGCC AGT CA 
TGGT GGC ATACACCTGT AGTC CCAGCAT T CCG GGAGGC T GAG GTGGGAGGAT C ACT TGAGCCC AGGGAGGT 
TGGGGCTGCAGTGAGCCATGATCACACCACTGCACTCCAGCCAGGTGACATAGCGAGATCCTGTCTAAAAA 
AATAAAAAAT AAAT AAT GGAACACAGCAAGTC CTAGGAAGT AGGTT AAAACT AATTCTT T AA 
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FIGURE 118 



£3 



MSTTTCQWAFLLSILGLAGCIAATGMDMWSTQDLYDNPVTSVFQYEGLWRSCVRQSSGFTECRP 
YFTILGLPAMLQAVRALMIVGIVLGAIGLLVSIFALKCIRIGSMEDSAKA1SMTLTSGIMFIVSGL 
CAIAGVSVFANMLVTNFWMS TANMYTGMGGMVQT VQTRYT FGAAL FVGWVAG GLTLI GGVMMC I A 
CRGLAPEETNYKAVSYHASGHSVAYKPGGn<ASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 

V 

Signal peptide: 

amino acids 1-23 

Transmembrane domains: 



m amino acids 81-100, 121-141, 173-194 

m 



IJI 

Ms* 
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FIGURE HQ 



GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGTTCCGGAGTCC 
AGCTGGCTAAAACTCATCCCAGAGGATAATGGCAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAf^CAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 
CATCAGGATGCAGTGCAAAATCTATGATTCCCTGCTGGCTCTTTCTCCGGACCTACAGGCAGCCAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
ACCAGGTGCACGGGGGACAATGAGAAGGTGAAGGGTCACATTCTGCTGACGGCTGGAATCATCTTCAT 
CATCACGGGCATGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATA 
ACT CAAT AGTGAATG T T GC CCAAAAACGT GAGCTT GGAGAAGCT CTCTACTTAGGAT GGAC CAC GGCA 
CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 
CAGATAC T C GAT ACC T TCCC ATC GC ACAACCC AAAAAAGTTAT CACACC GGAAAGAAGT C ACC GAGC G 
TCTACTCCAGAAGTCAGTATGTGTAGTTGTGTATGTTTTTTTAACTTTACTATAAAGCCATGCAAATG 
ACAAAA^TCTATATTACTTTCTCA?\AATGGACCCCAAAGAAACTTTGATTTACTGTTCTTAACTGCCT 
AAT CTTAAT T AC AGGAACT GT GCAT CAGC TAT T TAT GAT T C TATAAGC T AT TTCAGCAGAATGAG AT A 
TTAAACCCAATGCTTTGATTGTTCTAGAAAGTATAGTAATTTGTTTTCTAAGGTGGTTCAAGCATCTA 
CTCTTTTTATCATTTACTTCAAAATGACATTGCTAAAGACTGCATTATTTTACTACTGTAATTTCTCC 



Q ACGAC AT AG CAT TAT G TACAT AGAT GAG T GT AACAT T TAT AT CT C AC ATAGAGACAT GC T T ATAT GGT 



l/f, TTT ATTT AAAAT GAAATGCCAGT C CAT TACACT GAATAAATAGAACT CAACTATT GCTTT TCAGGGAA 

h* AT CAT GGATAGGGTTGAAGAAGGT TACTATT AATTGT TTAAAAACAGCT T AGGGATTAATGTCCT CCA 

■JZJ T T T ATAAT GAAGATTAAAA.T GAAGGCT T T AAT C AGCAT T G TAAAG GAAAT T GAAT GGCT T T CTGATAT 

m 

GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
CTTGTGTATTAAATTAACATTTTTAAAACGCAGATATTTTGTCAAGGGGCTTTGCATTCAAACTGCTT 
T T C CAGGGC TATACTCAGAAGAAAGATAAAAGT GT GATCTAAGAAAAAGTGAT GGTTTTAGGAAAGT G 
AAAATATTTTTGTTTTTGTATTTGAAGAAGAATGATGCATTTTGACAAGAAATCATATATGTATGGAT 
ATAT T T T AATAAG TAT T T GAGTAC AGAC TTT G AGGT T T CAT CAAT AT AAAT AAAAGAGC AGAAAAAT A 
TGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 
ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
T AT ACAT T TAT AT TAATAAATTGT ACATT TTT C TAAT T 
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FIGURE 120 

MATHALEIAGLFLGGVGMVGTVAVTVMPQW 

IYDSLLALSPDLQAARGLMCAASVMS FLAFMMAI LGMKCTRCTGDNEKVKAH ILLTAGI I FI I TG 
MWLIPVSWVANAIIRDFYNSIVNVAQKRSLGEALYLGWTTALVLIVGGALFCCVFCCNEKSSSY 
RYSIPSHRTTQKSYHTGKKSPSVYSRSQYV 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 32-101, 118-145, 164-188 
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FIGURE 121 



GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GATTCAAAGGAGAAAAGGGGGAATGTCTGAGGGAAAGCTTTGAGGAGTCCTGGACACCCAACTAC 
AAGCAGTGTTCATGGAGTTCATTGAATTATGGCATAGATCTTGGGAAAATTGCGGAGTGTACATT 
\ ■. TACAAAGATGCGTTCAAATAGTGCTCTAAGAGTTTTGTTCAGTGGCTCACTTCGGCTAAAATGCA 
p GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 
0 AT T GAAGC TAT AAT T TAT T TGGACCAAGGAAGC C C T GAAAT GAAT TCAACAAT TAATAT T CATC G 

M ? ; CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 
T T GGCACT T GTT CAGAT T ACC CAAAAGGAGATGC T T C T AC T GGATGGAAT T CAG T T T CTCGCAT C 
ATTATTGAAGAACTACCAAA ATAAA TGCTTTAATTTTCATTTGCTACCTCTTTTTTTATTATGCC 
TTGGAATGGTTCACT TAAATGACATTT TAAATAAGT TT AT GTATACATCTGAATGAAAAGCAAAG 
C T AAAT ATGT T T AC AGAC CAAAGT GT GAT T T C AC AC T GT T T T T AAAT C T AGC AT TAT T CAT T T TG 
CTTCAATCAAAAGTGGTTTCAATATTTTTTTTAGTTGGTTAGAATACTTTCTTCATAGTCACATT 
CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 
AAAAAATATAAAAGCTACCAATCTTTGTACAATTTGTAAATGTTAAGAATTTTTTTTATATCTGT 
T AAATAAAAAT TAT T T CC AAC A 



w 



Iff 

a 



m 
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FIGURE 122 
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MRPQGPAASPQRLRGLLLLLLLQLPAPSSASEIPKGKQKAQLRQREWDLYNGMCLQGPAGVPGR 
DGSPGANVIPGTPGIPGRDGFKGSKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 
MRSNSALRVL FS GS LRLKCRNACCQRW Y FT FNGAECSGPLP ISAI I YLDQGS PEMN S T INI HRTS 
SVEGLCEGI GAGLVDVAI WVGTCSDYPKGDAS TGWNSVSRI I IEELPK 

Signal peptide: 

amino acids 1-30 



\& Transmembrane domain: 

h*f amino acids 195-217 

13 
W 

S! 
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FIGURE 123 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCM 
ATTGCCTGGAAGAATACATCATGTTTTTCGATAAGAAGAAATTGTAGGATCCAGTTTTTTTTTTA 
AC C GCCC CC TCCCCACCCCCCAAAAAAACTGTAAAGAT GCAAAAACGTAAT AT CCATGAAGAT C C 
TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
T CT TCC T GGGT GT CAGC GAGC CCT GAC T CAC TACAGT GCAGC T GACAGGGGC T G TCATGCAAC T G 
GCCCCT AAGCCAAAGCAAAAGAC CTAAGGACGACC T T TGAACAATACAAAGGATGGGT T T CAAT G 
T AAT TAGGCTACTGAGCGGAT CAGC TGTAGCACT GGT TAT AGCCCCCACT GT CT TACTGACAAT G 
CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
AT CT CAGAAAT T ACAGGAGATAC CCT CAAG TATAT CT GC T GGT T GCT TAGGT T TGTCCCTTCGCT 
ATAACAGCCTTCAAAAACTTAAGTATAATCAATTTAAAGGGCTCAACCAGCTCACCTGGCTATAC 
CT T GACCATAAC C ATAT C AGCAATATT GACGAAAAT GC T T TTAAT GGAAT ACGC AG ACT CAAAGA 
GCTGATTCTTAGTTCCAATAGAATCTCCTATTTTCTTAACAATACCTTCAGACCTGTGACAAATT 
TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 
AGAC TGCCGCAACC TGGAAC T T T TGGACCTGGGATATAACCGGATCCGAAGT T T AGCC AGGAAT G 
T C T T T GC TGGCAT GATC AGAC TCAAAGAAC T TC ACCTGGAGC ACAAT CAAT T T T CC AAGC T C AAC 
f f \ CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 
CATAGGACAGACCATGTCCTGGACCTGGAGCTCCTTACAAAGGCTTGATTTATCAGGCAATGAGA 
tfl TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
kEj T CC AACAAGC T CAC AT T TAT T GGT CAAG AG AT T T T G GAT T C T T GG AT AT C C C T CAAT GAC AT CAG 

TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 
T TAAAGGTCTAAGGGAGAATACAAT TATCTGTGCCAGT CCCAAAGAGCT GCAAGGAGTAAAT GT G 
AT C GATGCAG TGAAGAAC T ACAGCATCT GTGGCAAAAGT AC TACAGAGAGGT TTGATCTGGC CAG 
Ifl GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
y! TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 
||| TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 
S CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 
GGC AC AGGAAAAAGAAAAGACAGT C CCTAAAGCAAAT GAC TCCCAGC AC CC AGGAATT TT AT GT A 
il}. GATTATAAACCCACCAACACGGAGACCAGCGAGATGCTGCTGAATGGGACGGGACCCTGCACCTA 
TAACAAATCGGGCTCCAGGGAGT GT GAGGTATGAAC CAT TGTGATAAAAAGAGC TCTTAAAAGC T 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
GGT CAT T T TCCT CT CATACATAATCAACCCAT TGAAAT T T AAATAC CACAATCAATGTGAAGCT T 
GAACT CCGGT T T AATATAAT AC C TAT T GT ATAAGAC C C T T TAC TGAT TC CAT T AAT GTC GC AT T T 
GT T T T AAGATAAAACT T C T T T CATAGGTAAAAAAAAAAA 



rt 



: : 



h4 



App_ID-10063699 



Page 277 of 322 



e 



0 



ry 



FIGURE 124 



MGFNVIRLLSGSAVALVIAPTVLLTMLSSAERGCPKGCRCEGKMVYCESQKLQEIPSSISAGCLG 
LS LRYNSLQKLKYNQ FKGLNQLT WLYLDHNH I SNI DENAFNGI RRLKEL I LS SNR I S YFLNNT FR 
PVTNLRNLDL S YNQLHS LGS EQ FRGLRKLLSLHLRSNSLRT I PVRI FQDCKNLELLDLG YNR IRS 
LARNVFAGMIRLKELHLEHNQFSKLNLALFPRLYSLQNLYLQWNKISVIGQTMSWTWSSLQRLDL 
SGNEIEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQEILDSWISLNDISLAGNIWECSRNICSLVN 
WLKS FKGLRENT 1 1 CAS PKELQGWVI DAVKNYS I CGKS T TERFDLARALPKPTFKPKLPRPKHE 
SKPPLPPTVGATEPGPETDADAEHISFHKIIAGSVALFLSVLVILLVIYVSWKRYPASMKQLQQR 
SLMRRHRKKKRQSLKQMTPSTQEFYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 

Important features of the protein: 



011 Signal peptide; 

\M amino acids 1-33 



Transmembrane domain: 

amino acids 420-442 

h 

Is? I N-glycosylation sites. 

amino acids 126-129, 357-360, 496-499, 504-507 



oAMP- and aGMP- dependent protein kinase phosphorylation site. 

amino acids 4 65-4 68 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 125 



CCGTTATCGTCTTGCGCTACTGCTGAATGTCCGTCCCGGAGGAGGAGGAGAGGCTTTTGCCGCTG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACCTTTCCCCTGGATCTCACAAAAACTCGACTCCAAATGCAAGGAGAAGCAGCTCTTGCTC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 
AT T G AAGAGGAAGGCT T TCT AAAGC T T T GGCAAGGAG T G ACACC CGC CAT T T AC AGACAC G T AGT 
GTATTCTGGAGGTCGAATGGTCACATATGAACATCTCCGAGAGGTTGTGTTTGGCAAAAGTGAAG 
ATGAGCATTATCCCCTTTGGMATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
T TAGCCAATCCAACTGACC TAGTGAAGGTTCAGATGCAAATGGAAGGAAAAAGGAAACTGGAAGG 
t , AAAACCATTGCGATTTCGTGGTGTACATCATGCATTTGCAAAAATCTTAGCTGAAGGAGGAATAC 
Q GAGGGCTTTGGGCAGGCTGGGTACCCAATATACAAAGAGCAGCACTGGTGAATATGGGAGATTTA 
y| ACCACTTATGATACAGTGAAACACTACTTGGTATTGAATACACCACTTGAGGACAATATCATGAC 
W TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 



m 

. Ft 

CI 
in 

m 



TCAAAAGCAGAATAAT GAATCAACC ACGAGATAAAC AAGGAAGGGGACT T T TGTATAAAT CATC G 
ACTGACTGCTTGATTCAGGCTGTTCAAGGTGAAGGATTCATGAGTCTATATAAAGGCTTTTTACC 
ATCTTGGCTGAGAATGACCCCTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGA 
T GAG TGGAGTCAGT CCAT T T TAA 
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FIGURE 126 



MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAELATFPLDLTKTRLQMQGEAALARLGDGARES 
APYRGMVRTALG I IEEEGFLKLWQGVT PAI YRHWYS GGRMVTYEHLREWFGKS EDEH YPLWKS 
VIGGMft&GVIGQFLANPTDLV^ 

N I QRAALV1SJMGDLTTYDTVKHYLVLNTPLEDNIMTHGLS S LCSGLVAS I LGTPADVIKSRIMNQP 
RDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 



Transmembrane domains: 

amino acids 25-38, 130-147, 233-248 

w 

Is?! 

m 

7 , 3 

W 

If! 
ill 

Si 

a 

ill 

i 
US 

a 

fl! 
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FIGURE 127 



CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGGCATGG 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 
CAAATGAAGAGGACATCCTCTCCATGTTCTCAGCTATCCGTTCTCAGCACAGCGGTGTAGACATC 
TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 
GGAC AT GT T C AA_T GTGAACGTGCT GGC CCTCAGCAT CT G CACACGGGAAG C C TAC CAG T CCATGA 
AGGAGCGGAATGTGGACGATGGGCACATCATTAACATCAATAGCATGTCTGGCCACCGAGTGTTA 
CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 
W GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 

m 



la 

m 



AGACACAAT TCGCC T T CAAACT CCACGACAAGGACCCT GAGAAGGCAGCT GCCACCTAT GAGCAA 
ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 
CATCCAGATTGGAGACATCCAGATGAGGCCCACGGAGCAGGTGACCTAGTGACTGTGGGAGCTCC 
hB TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 
■ § CACGGGATACCACTTCCTGTCCACACCCCGACCAGGGGCTAGAAAATTTGTTTGAGATTTTTATA 
f5| TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 
Sii TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 
fr| TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 
C| TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
M AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 
C AGAAC AC CAG GGC C T GGCCCAGT GG AT T T CAT GG T GAT CAT T AAAAAAG AAAAAT C G CAAC CAA 
AAAAAAAAAA 
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FIGURE 128 

MARPGMERWRDRLALVTGAS GGI GAAVARAIjVQQGLKVVGCARTVGN I EELAAECKSAGYPGTL I 
PYRCDLSNEEDILSMFSAIRSQHSGVDICINN^^ 

EAYQSMKERWDDGHIININSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 
ISP G WE T Q FAFKLHDKD PE KAAAT YE QMKCLKPEDVAEAVI Y VL S T PAH I Q I GD I QMR P TE QVT 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

N-myristoylation sites. 

amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 
199-205 

Short-chain alcohol dehyrogenase . 

amino acids 30-42, 104-114 
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FIGURE 120 



AACTTCTACATGGGCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTGCTGCCGGTGGCCTACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
CCGTTGAGTGCTTGAAA TGAG GAACTGAGAAAATTAATTTCTCATGTATTTTTCTCATTTATTTA 
T TAAT T TT TAAC T GATAGT T G TACATAT T TGGGGGT ACAT GT GAT AT T T G GATACAT G TATACAA 
TATATAATGATCAAAT CAGGGTAAC TGGGATATCCATCAGATCAAAC AT TT ATTT TT TAT TC TT T 
1?^ T TAGACAGAG TC TC AC T C TGTCAC C C AGGC TGGAG T GC AGT GGT GC C AT C T C AGC T TAC T GCAAC 

13 CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 
Zt! GCACCACAATGC CCAACTAAT T T T T G T ATTT TTAG T AGAGACGGGGT TT TGCCATG T TGC CCAGG 

01. 

|.| CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 
g| GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 
ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 



W ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 

II! 



«,» AAC CT CT GATCACC TC AT T C TAC TC TC TAC C TCCAT GAGATCCAC T T TTTT AGCTCCCACAT G T G 

AGTAAGAAAATGCAATATTTGTCTTTCTGTGCCTGGCTTATTTCACTTAACATAATGACTTCCTG 
|3 TT CCATCCATGTTGCTGCAAATGACAGGAT TTCGTT CT TAAT T TCAATTAAAATAACCACACATG 

HI GCAAAAA 

O 
fll 
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FIGURE ISO 

MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGTLGESPMPFKRVFCQDGNVRSFCVCAVHFSSHQPPVAVECLK 



Important features of the protein; 
Signal peptide: 

amino acids 1-18 



rl 

n 

m 
m 



n 

n 

f ?? 

CI 



N-myristoylation site. 

amino acids 8 6-92 

Zinc carboxypeptidases , zinc-binding region 2 signature. 

amino acids 68-79 
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FIGURE 131 



TTCT GAAGTAACGGAAGCT ACCTT GT ATAAAGACCT CAACACTGCTGACCATGATCAGCGCAGCCTGG AGC 
ATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTCTATCAGTTATGGCTAAATCCTG 
TCCATCTGTGTGTCGCTGCGATGCGGGTTTCATTTACTGTAATGATCGCTTTCTGACATCCATTCCAACAG 
GAATACCAGAGGATGCTACAACTCTCTACCTTCAGAACAACCAAATAAATAATGCTGGGATTCCTTCAGAT 
TT GAAAAACTTGCT GAAAGTAG AAAGAATATACCTATACC AC AACAGT TT AGAT GAATTT CCTACCAACCT 
CCCAAAGTATGTAAAAGAGTTACATTTGCAAGAAAATAACATAAGGACTATCACTTATGATTCACTTTCAA 
AAATTCCCTATCTGGAAGAATTAGATTTAGATGACAACTCTGTCTCTGCAGTTAGCATAGAAGAGGGAGCA 
TTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTAATCACCTTAGCACAATTCCCTGGGGTTT 
GCCCAGGACTATAGAAGAACTACGCTTGGATGATAATCGCATATCCACTATTTCATCACCATCTCTTCAAG 
1 . GTCTCACTAGTCTAAAACGCCTGGTTCTAGATGG AAACCTGTTGAACAAT CAT GGTTTAGGTGACAAAGT T 

'm TTCTTCAACCTAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCT 
P| TCCAGGCACAAACCTGAGGAAGCTTTATCTTCAAGATAACCACATCAATCGGGTGCCCCCAAATGCTTTTT 
Ifl CT T ATCTAAGGCAGCT CT ATCGACTGGATATG TCCAAT AATAACCT AAGT AAT T T ACCTCAGGGT ATCT T T 

h| GATGATTTGGACAATATAACACAACTGATTCTTCGCAACAATCCCTGGTATTGCGGGTGCAAGATGAAATG 



GGTACGTGACTGGT TAC AAT C ACT ACCTGT GAAGGT CAACGTGCGTGGGC TC ATGTGCCAAGCCCCAGAAA 
AGGTTCGTGGGATGGC T ATT AAGGAT CT C AAT GC AGAACTGTTTG ATT GT AAGG AC AGTGGGATT GTAAGC 
ACCATTCAGATAACCACTGCAATACCCAACACAGTGTATCCTGCCCAAGGACAGTGGCCAGCTCCAGTGAC 

! ps CAAACAGCCAGATATTAAGAACCCCAAGCTCACTAAGGATCAACAAACCACAGGGAGTCCCTCAAGAAAAA 

£3 

CAATTACAATTACT GT GAAGTCTGTCACCTCTGATACCATTCATATCTCTTGGAAACTTGCTCTACCT ATG 



m 
m 



ACTGCTTTGAGACTCAGCTGGCTTAAACTGGGCCATAGCCCGGCATTTGGATCTATAACAGAAACAATTGT 
AACAGGGG AACG CAGT GAGTACTTGGT CAC AGCCCT GGAGCCTG ATTC ACCCT ATAAAGT AT GCATGGT T C 
p| CCATGGAAACCAGCAACCTCTACCTATTTGATGAAACTCCTGTTTGTATTGAGACTGAAACTGCACCCCTT 
f|| CGAATGT ACAACCC TAC AACCACCCTCAAT CGAGAGCAAGAGAAAGAACC T T ACAAAAAC CCC AAT TT ACC 

TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
TTCATAGGAATGGATCGCTCTTCTCAAGGAACTGTGCATATAGCAAAGGGAGGAGAAGAAAGGATGACTAT 
GCAGAAGCTGGCACTAAGAAGGACAACTCTATCCTGGAAATCAGGGAAACTTCTTTTCAGATGTTACCAAT 
AAGCAATGAACCCAT CTC GAAGGAGGAGTT TGTAAT ACACACCATATT TC C T CCT AATGGAAT GAATCT GT 
ACAAAAACAATCACAGTGAAAGCAGTAGTAACCGAAGCTACAGAGACAGTGGTATTCCAGACTCAGATCAC 
TCACACTCATGATGCTGAAGGACTCACAGCAGACTTGTGTTTTGGGTTTTTTAAACCTAAGGGAGGTGATG 
GT 
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FIGURE 132 

MISAAWSIFLIGTKIGLFLQVAPLSVMAKSGPSVCRCDAGFIYCNDRFLTSIPTGIPEDATTLYL 
QNNQINNAGIPSDLKNLLKVERIYLYHNSLDEFPTNLPKYVKELHLQENNIRTITYDSLSKIPYL 
EELHLDDNSVSAVSIEEGAFRDSNYLRLLFLSRNHLSTIPWGLPRTIEELRLDDNRISTISSPSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFSYLRQLYRLDMSNNKLSNLPQGIFDDLDNITQLILRNNPWYCGCKMKWVRDWLQSLPV 
KVNVRGLMCQAPEPCVRGMAIKDLMAELFDCKDSGIVST IQI TTAI PNTVYPAQGQWPAPVTKQPD 
IKNPKLTKDQQTTGSPSRKTITITVKSVTSDTIHISWKLALPMTALRLSWLKLGHSPAFGSITET 
IVTGERSEYLVTALEPDSPYKVCMVPMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKE 
5 , PYKNPNLPLAAI IGGAVALVTIALLALVCWYVHRNGSLFSRNCAYSKGRRRKDDYAEAGTKKDNS 

f% ILEIRETSFQMLPISNEPISKEEFVIHTIFPPNGMNLYKNNHSESSSNRSYRDSGIPDSDHSHS 

Important features of the protein: 
Signal peptide: 

amino acids 1-28 
Transmembrane domain: 
amino acids 531-552 
N-glycosylation sites. 

amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 
Tyrosine kinase phosphorylation site, 
amino acids 515-522 
N-myri s toy la ti on sites . 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 



m 

W. 
m 

m 
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FIGURE 133 



CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTCATGTCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
T GCAGAGGCAG T CT GGGCTTGGCCAGAGCTCAGGGT GCTGAGCGT GT GACCAGCAGTGAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
y% CTCCACTACCTCAAGCTGAGTGATCCCAAGTACCTAAGAGAGTTCCAGCTGACCCTCCAGCCCGG 
W GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 

M CCCAGGAC TCAT TC TC AGAGGAGAGAAGT GAC GT GT GCCT GGTGCAGCTGC T GGGAACCGGGAC G 

01 

ft/s GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 

M* 

ff! AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 

«?- 

feO GACCACT CCAACAGAGC CAGGAC TATATGAACCT CT TC TGCGC CAAC AT GATGGACT TGAACCGC 

W AGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACATCTTCATGGAAAACATCATGTTCTG 
TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
AACAGCAGGAAGGATGCTTCGGGGAGCCTGATGCTGAAGATGAAGAATTATCTAAAGCTATTCAA 
TATCAGCAGCATTTTTCGAGGAGAGTGAAGAGGCGAGAAAAACAATTTCCAGATTCTCGCTCTGT 
£9 TGCTCAGGCTGGAGTAGAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 
AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 
TTTTATATTTTTTTAGTAGAGACAGGGTTTCATCATGTTGCTCATGCTGGTCTCGAACTCCTGAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATTATAGGTGTGAGCCACCGTGTCTG 
GCTGAAAAGCACT T T CAAAGAGACTGTGTTGAATAAAGGGCCAAGGT TCT TGCCACCCAGCACT C 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
GT GGC T TCC TAT AC AT C CTGGC AGAAT ACCCCCC AGCAAACAGAGAGCCACACC CAT CCACACC G 
CCACCACCAAGCAGCCGCTGAGACGGACGGTTCCATGCCAGCTGCCTGGAGGAGGAACAGACCCC 
T T T AGT CC T CAT C C C T T AGATC C T G GAG GGCAC G GATC AC AT C C TGGGAAG AAGGC AT C T GGAGG 
ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
GGCCGGGAGGGACCCAGGTGTGAACGGATGAATAAAGTTCAACTGCAACTGAAAAAAAAAAA 



5 

i y 
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FIGURE 134 

MSARGRWEGGGRRACRGSLGLARAQGAERVTSSEQRPAMASLGLLLLLLLTALPPLWSSSLPGLD 
TAESKATIADLILSALERATVFLEQRLPEINLDGMVGVRVLEEQLKSVREKWAQEPLLQPLSLRV 
GMLGEKLEAAIQRSLHYLKLSDPKYLREFQLTLQPGFWKLPHAWIHTDASLVYPTFGPQDSFSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRS1MTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
YINLFCANl^DLNRRAEAIGYAYPTRDIFMENIMFCGMGGFSDFYKLRWLEAILSWQKQQEGCFG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
S S WD YRS VP P YLANFY I FLVET GFHHVAHAGLELL I SRDPP TS GS QS VGL 



Important features of the protein: 
Signal peptide: 



CI amino acids 1-2 6 

9! 
m 

m 

t.n amino acids 39-56 



Transmembrane domain: 



S Tyrosine kinase phosphorylation sites. 

f*; amino acids 149-156, 274-282 

.Ml 

gj N-myristoylation sites. 

P amino acids 10-16, 20-26, 63-69, 208-214 

III 

Amida ti on si te . 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1. 

amino acids 230-237 
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FIGURE las 



5s 5 



GGTCTGAGTGCAGAGCTGCTGTCATGGCGGCCGCTCTGTGGGGCTTCTTTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CT CAGGAC T GGATC TCGGCGGCCCGAGT GC T GGT AGAC G GAGAAGAGCACGTCGGTT TCC T T AAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGCTTACAGATTTGATCCCGTTCGAGTGGATATCACTTCGAAAGGAAAAATGAGAGCAAGATATG 
TGAATTACATCAAAACATCAGAGGTTGTCAGACTGCCCTATCCTCTCCAAATGAAATCTTCAGGT 
CCACCTTCTTACTTTATTAAAAGGGAATCGTGGGGCTGGACAGACTTTCTAATGAACCCAATGGT 
TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 
CTGACATGAGACGGGAAATGGAGCAGTCAATGAATATGCTGAATTGCAACCATGAGTTGCCTGAT 
GTTTCTGAGTTCATGACAAGACTCTTCTCTTCAAAA.TCATCTGGCAAATCTAGCAGCGGCAGCAG 
|f| TAAAACAGGCAAAAGTGGGGCTGGCAAAAGGAGGTAGTCAGGCCGTCCAGAGCTGGCATTTGCAC 
ill AAACACGGCAACACTGGGTGGCATCCAAGTCTTGGAAAACCGTGTGAAGCAACTACTATAAACTT 

m 

GAGTCATCCCGACGTTGATCTCTTACAACTGTGTATGTT 

AACTTTTTAGCACATGTTTTGTACTTGGTACACGAGAAAACCCAGCTTTCATCTTTTGTCTGTAT 
n GAGGT CAATATT GAT GTCAC TGAAT TAAT TACAG T GT CC T AT AGAAAAT GC CAT TAATAAAT TAT 

IJI AT GAAC T ACT AT ACAT TATGTAT AT TAAT TAAAACATC T TAAT C C AGAAATCAAAAAAAAAAAAA 

yl AAAAAAAAAAAAAAA 

tn 
m 

m 
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FIGURE 136 



MAAALWGFFPVLLLLLLSGDVQSSEVPGAAAEGSGGSGVGIGDRFKIEGRAWPGVKPQDWISAA 
RVLVDGEEHVG FLKT DG S FWHD IPS GSYWEWS P AYR F D P VRVD ITS KGKMRAR YVNY I KT S E 
VVRLPYPLQMKSSGPPSYFIKRESWGWTDFLMNF 
QSMNMLNSNHELPDVSEFMTRLFSSKSSGKSSSGSSKTGKSGAGKRR 



u 

m 



Important features of the protein: 
Signal sequence : 

amino acids 1-23 



p| Transmembrane domain: 



amino acids 161-182 

N-glycosylation site. 

amino acids 184-187 

Glycosaminoglycan attachment sites. 



P amino acids 37-40, 236-239 

y? 

wit 

VI 

5% cAMP- and cGMP-dependent protein kinase phosphorylation site. 

:p% amino acids 151-154 

ry 

N-myristoylation sites. 

amino acids 33-33, 36-41, 38-44, 229-234 

Amidation site. 

amino acids 238-241 



ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 



AppJD=l 0063699 



Page 290 of 322 



ill 



FIGURE 137 



GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAG C TACGTTGGC T T T CT GGAAGGGGAGGCTATATGCGT CAAT T CC CCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACCTGTGC 
CACCAACTCGCACTCAGACTCTGAACTCAGACCTGAAATCTTCTCTTCACGGGAGGCTTGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CT T CT C TC TG CT G C G T T T TATC TC CTAT GG AC TCCT T CCAC TGGACT GAAGACACTCAAT T T G GG 
AAGCTGTGTGATCGCCACAAACCTTCAGGAAATACGAAATGGATTTTCTGAGATACGGGGCAGTG 
T GCAAGCCAAAGATGGAAACAT T GACATCAGAATCT TAAGGAGGACT GAGT CTTTGCAAGACACA 
Li AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
f e sl AAACTACCAGACCCCTGACCATTATACTCTCCGGAAGATCAGCAGCCTCGCCAATTCCTTTCTTA 
Q CCATCAAGAAGGACCTCCGGCTCTCTCATGCCCACATGACATGCCATTGTGGGGAGGAAGCAATG 



m 

U 

.11 

||| CT GCT AAGAATAT T CGAGG T CAAGAG CTCCAG TC T T CAAT ACC TGCAGAGGAGG CAT GAC C C CAA 

iff ■ ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 



AAGAAATACAGCCAGATTCTGAGTCACTTTGAAAAGCTGGAACCTCAGGCAGCAGTTGTGAAGGC 
T T TGGGGGAACT AGACAT TC TTC TGCAATGGATGGAGGAGACAGAATAGGAGGAAAGTGATGC TG 



CTTCCTTG CAT GAT T GTC T T TAT GC AT C C C CAAT C T T AAT T GAGACC AT AC T T G T ATAAGAT T T T 
TGT AATATCTT T C TG CTAT TGGATATAT TTAT TAGT TAAT ATATT TAT TTAT TT T TTG CTAT T TA 



M 

if! 

AT GT AT T TAT T TT T T T AC TTG GACAT GAAAC T T TAAAAAAAT T CACAGAT T AT AT T TATAAC C T G 
|p ACTAGAGC AGGT GATGTATT TT T ATACAGTAAAAAAAAAAAACCTT GTAAATTCTAGAAGAGTGG 

p CTAGGGGGGTTATTCATTTGTATTCAACTAAGGACATATTTACTCATGCTGATGCTCTGTGAGAT 
ATTTGAAATTGAACCAATGACTACTTAGGATGGGTTGTGGAATAAGTTTTGATGTGGAATTGCAC 
AT CT AC CT T ACAAT TAC T GAC CATC C C CAG TAGACT CCCC AGT CCCATAAT T GTGT AT CT T CCAG 
C CAGGAATCC T ACACGGCCAGCAT GTAT TTC TAC AAAT AAAGT T T TC T T TGCATACCAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 138 



MRQFPKTSFDISPEMSFSIYSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEIF 

S SREAWQFFLLLW SPDFRPKMKAS S LAFS L LS AAFYLLWTP S TGLKT LNLGS CVI ATNLQE IRNG 

FSEIRGSVQAKDGNIDIRILRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYTLRKIS 

SIJ^SFLTIKKDLRLSHAHMTCHCGEEAMK^^ 

E 

Important features of the protein: 
Signal peptide: 

amino acids 1-42 




N-myristoylation sites. 



cAMP- and cGMP-dependent protein kinase phosphorylation sites. 



amino acids 192-195, 225-223 



amino acids 42-47, 46-51, 136-141 
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FIGURE 13Q 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACCTCTCTTCAGTAAAGTTGTTATTGTTCTGATAGATGCCTTGAGAGA.TGATTTTGTGTTTG 
GG T CAAAGGG TGT GAAAT T TAT GCC CT ACACAACT T AC CT TGTGGAAAAAGGAGCAT C T C AC AGT 
TTTGTGGCTGAAGCAAAGCCACCTACAGTTACTATGCCTCGAATCAAGGCATTGATGACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
TAAGACAAGCAAAAGCAGCT GGAAAAAGAATAGTCT TT TATGGAGA.TGAAACCTGGGTTAAATTA 
TTCCCAAAGCATTTTGTGGAATATGATGGAACAACCTCATTTTTCGTGTCAGATTACACAGAGGT 
GGATAATAATGTCACGAGGCATTTGGATAAAGTATTAAAAAGAGGAGATTGGGACATATTAATCC 
TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
AAGCTGAGCGAGAT GGACAGCGT GC TGATGAAGATCCACACCTCACT GCAGTCGAAGGAGAGAGA 
GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 
GGGCCTCCTCCACCGAGGAGGTGAATACACCTCTGATTTTAATCAGTTCTGCGTTTGAAAGGAAA 



CI CCCGGTGATATCCGACATCCAAAGCACGTCCAATAGACGGATGTGGCTGCGACACTGGCGATAGC 
E| ACTTGGCTTACCGATTCCAAAAGACAGTGTAGGGAGCCTCCTATTCCCAGTTGTGGAAGGAAGAC 



CAATGAGAGAGCAGTTGAGATTTTTACATTTGAATACAGTGCAGCTTAGTAAACTGTTGCAAGAG 
AAT GT GCCGT CATATGAAAAAGAT C C T GGG T T TGAGCAGT T T AAAAT G T CAGAAAGAT T GCAT GG 
GAACTGGATCAGACTGTACTTGGAGGAAAAGCATTCAGAAGTCCTATTCAACCTGGGCTCCAAGG 
TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
| ? P' TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
^ CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 
2. GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
CI AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 
t!| TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
Q CAAGGTGATTGTAAAGAGCTGGCGGTCACAGAGGAACAAGCCCCCCAGCTGAGGGGGTGTGTGAA 
TCGGACAGCCTCCCAGCAGAGGTGTGGGAGCTGCAGCTGAGGGAAGAAGAGACAATCGGCCTGGA 
CACTCAGGAGGGTCAAAAGGAGACTTGGTCGCACCACTCATCCTGCCACCCCCAGAATGCATCCT 
O GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
f|| TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
AC T CAGAT CCAC AGAGCCCAGGATCAAGGGACCCAC TGCAG TGGCAGCAGGACTGTT GGGCCCC C 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
CTGACCGAGACACTCACAGCTTTGTCATCAGGGCACAGGCTTCCTCGGAGCCAGGATGATCTGTG 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
CTGCACACAGTATGTAGTTACCAAAAGAATAAACGGCAATAATTGAGAAAAAAAA 
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FIGURE 14Q 



MRLGS GT FATCCVAI EVLG I AVFLRG FFPAPVRS S ARAEHGAE P PAP E P SAGAS SNWTTLPPPLF 
SKWIVLIDALRDDFVFGSKGVKmPYTTYLVEKGASHSFVAEAKPPTVTMPRIKAMTGSLPGF 
VDVI RNLNS PALLEDSVI RQAKA&GKRI VFYGDETWVKLFPKHFVE YDGTT S FFVS DYTEVDNNV 
TRHLDKVLKRGDWDILILHYLGLDHIGHISGPNSPLIGQKLSEMDSVLMKIHTSLQSKERETPLP 
NLLVLCGDHGMSET G SHGAS S TE E VNT PL I L I S SAFERKPGD I RHPKHVQ 



Lj... 



m 



Important features of the protein: 
Signal peptide: 

amino acids 1-34 

Transmembrane domain: 



fcs> amino acids 58-76 

m 

Jil 

N-glycosylation sites . 



amino acids 56-60, 194-198 



N-myristoylation sites. 

amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 



a 
m 

M 275-281, 278-284 

m 

|3 Amidation site. 

fll amino acids 154-158 



Cell attachment sequence. 

amino acids 205-208 
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FIGURE 141 



0 

PI 

m 



\ ■ 

m 
a 

m 



rii 



GGCACGAGGCAAGCCTTCCAGGTTATCGTGACGCACCTTGAAAGTCTGAGAGCTACTGCCCTACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTGATGTTACTGCTGCTGTTGGAGTACAACT 
TCCCTATAGAAAACAACTGCCAGCACCTTAAGACCACTCACACCTTCAGAGTGAAGAACTTAAAC 
CCGAAGAAAT TCAGCAT TCATGACCAGGATCACAAAGT ACTGGTCCT GGACTCTGGGAAT CT CAT 
AGCAGTTCCAGATAAAAACTACATACGCCCAGAGATCTTCTTTGCATTAGCCTCATCCTTGAGCT 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
T G T GAC AAGGAT AAAGGACAAAGT CAT CCATCCCT T CAGC TGAAGAAG GAGAAAC TGATGAAGC T 
GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 
ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 
GT T GGGGTGAC AGATAAAT T TGAGAAC AGGAAACACAT T GAATTTTCAT T T CAAC CAGT T TGCAA 
AGCTGAAATGAGCCCCAGTGAGGTCAGCGATTAGGAAACTGCCCCATTGAACGCCTTCCTCGCTA 
AT T TGAAC T AATTGTATAAAAACAC CAAACCTGC TCAC T 
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FIGURE 142 

MLLLLLEYNFPIENNCQHLKTTHTFRVKNLNPKKFSIHDQDHKVLVLDSGMLIAVPDKNYIRPEI 
FFALAS SLSSAS AEKGS P I LL GVS KGEFCL YCDKDKGQSHPS LQLKKEKLMKLAAQKESARRPFI 
FYRAQVGSWNMLESiyUiPGWFICTSCNCNEPYGVTDKFENRKHIEFSFQPVCKAEMSPSEVSD 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 33-36 



N-myristoylation site* 

amino acids 50-55, 87-92 

Interleukin-1 



m 

■*£ amino acids 37-182 

m 

# 



m 

? n 

rn 
ci 

rn 
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FIGURE 143 



CTAGAGAGTATAGGGCAGAAGGATGGCAGATGAGTGACTCCACATCCAGAGCTGCCTCCCTTTAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGACTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 
CAGTGGAGTTTGCTGTCCACACATTCAACCAACAGAGCAAGGACTACTATGCCTACAGACTGGGG 
CACAT C T TGAATT CCTG GAAGGAG CAGG TG GAGTCC AAGAC T G TAT T CT CAAT GGAGC TACT GC T 
GGGGAGAACTAGGT GT GGGAAA.T T T GAAGACGACAT TGACAACTGCCAT T T CCAAGAAAGCACAG 
fa AGCTGAAC AATACT T T C AC CTGCTTCTT CAC CATCAGC AC CAGGCC CTGG AT GACT CAGT T CAGC 

13 CTCCTGAACAAGACCTGCTTGGAGGGATTCCACTGAGTGAAACCCACTCACAGGCTTGTCCATGT 
fci GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 



TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 



m 
U 

m ACACAT CT TGAGCC TAATCATGT AGTGTAGATCAT T AAACAT CAGCAT T T TAAGAAAAAAAAAAA 

g| AAAAAAAAAAAAAAAAA&AAAAAAAAAAAAAAAAAA^^ 

5 

m 

y 
III 
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FIGURE 144 

MLGLPWKGGLSWALLLLLLGSQILLIYAWHFHEQRDCDEHMVMARYLPATVEFAVHTFNQQSKDY 
YAYRLGHILNSWKEQVESKTVFSMELLLGRTRCGKFEDDIDNCHFQESTELNNTFTCFFTISTRP 
WMT QFS LLNKT CLEG FH 



Important features of the protein: 
Signal peptide: 

amino acids 1-25 



e 

ri 



m 

m 
m 

m 



Vr* 

la 

ru 



N-glycosylation sites. 

amino acids 117-121, 139-143 



\m N-myristoylation site. 



amino acids 9-15 
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FIGURE 145 



CTGTGCAGCTCGAGGCTCCAGAGGCACACTCCAGAGAGAGCCAAGGTTCTGACGCGATGAGGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCATCAAGCACAGAATCAAGTGGAACCGGAAGGCCCTGCCCAGCACTGCCCAGATCA 
CT GAGG CC CAGGT GGC TGAGAACCGCC CGGGAGCCT T C ATCAAGCAAGGCCGCAAG C T CGACAT T 
GACTTCGGAGCCGAGGGCAACAGGTACTACGAGGCCAACTACTGGCAGTTCCCCGATGGCATCCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGCGGCGAACCAGGGGGAGTTCCAGAAGCCAGACAACAAGCTCCACCAGCAGGTGCTCTGG 
CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
AAGCTTGCCAGGAGGCTGGCAGTACAGAGCGCAGCAGCGAGCAAATCCTGGCAAGTGACCCAGCT 
CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 



||| TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 

|y CCGTCACTCCATTCTCAGCCCCTAGCAGAGCGTCTGGCACACTAGATTAGTAGTAAATGCTTGAT 

m 

m GAGAAGAACACATCAGGCACTGCGC CACCTGC T TCACAG T AC T TCCCAACAACT CTTAGAGGTAG 

'%l GTGTATTCCCGTTTTACAGATAAGGAAACTGAGGCCCAGAGAGCTGAAGTACTGCACCCAGCATC 
ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 

12 

p| GTCCAATTCCAGAGCTGTCTGGTGATCACTTTATGTCTCACAGGGACCCACATCCAAACATGTAT 

If! C T C TA&T GAAAT T GT GAAAGC T CCAT GT T T AGAAAT AAAT GAAAACAC C TGA 

0 

S9 ' 
0 
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FIGURE 146 



MRKHLSWWWLATVCMLLFSHLSAVQTRGIKHRIKWNRKALPSTAQITEAQVAENRPGAFIKQGRK 
LDIDFGASGNRYYEANYWQFPDGIHYNGGSEANVTPCEAFVTGCINATQAANQGEFQKPDNKLHQQ 
VLWRLVQELCSLKHCEFWLERGAGLRVTMHQPVLLCLLALIWLMVK 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 157-171 




N-glycosylation sites . 



amino acids 98-102, 110-114 



Tyrosine kinase phosphorylation site. 



amino acids 76-83 



0 
if! 
Q 
HI 



Arnidation site. 



N-myristoylation sites. 

amino acids 71-77, 88-94, 93-99, 107-113, 154-160 



fll 



amino acids 62-66 
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FIGURE 147 



GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCTGATGATTTATAGACTCAAAGAAAACTC ATG TTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
CAACTTTCAGATTCAGGGGGTACATGTGAAGGTTTGTTTTATGAGTATATTGCATGATGCTGAGG 
TTTGGGGT 



a 
m 

k'A 

m 

m 

0 
m 

0 

m 

m 
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FIGURE 148 



MFRS SLLFWPPLCLLSLFLL I LI S S I YS ES CKLE I FH FACQWGRSLSLS FYFLKFQLS DSGGTCE 
GLFYEYIA 

Important features or* the protein: 
Signal peptide: 

amino acids 1-25 



0 
if! 

m 

n 



N-myristoylation site. 

amino acids 62-68 
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FIGURE 1AQ 



GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTAC AGCTCCAGGAGCCCAGCGCCGGGCT GT GAC CCAAGC C GAGC GTGGAAGAATGGGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAATCTCTACATAATAGAGAATTAAGTGCAGAAAGACCTTTGAATGAACAGATTGCTGAA 
G CAGAAGAAGACAAGATTAAAAAAACATATCC T CCAGAAAACAAGCCAGGT CAGAGCAACTATT CTTT 
TGTTGATAACTTGAACCTGCTAAAGGCAATAACAGAAAAGGAAAAAATTGAGAAAGAAAGACAATCTA 
T AAGAAGCTCC CCACT T GAT AATAAGTTGAATGTGGAAGAT GT T GAT T CAAC CAAGAATC GAAAACTG 
ATC GATGATTATGACTCTACTAAGAGT GGATTGGATCATAAATT T CAAG ATGATCCAGAT GGTCTTCA 
T CAAC T AGACGGG ACT CCT T T AAC CGCT GAAGAC AT T GT C C AT AAAAT C GC T G CC AGGAT T TAT GAAG 

P! 

tZ AAAAT GACAGAGCCGT GTTT GACAAGAT T GT T T C TAAAC TACT T AAT CTCGGCCT T ATCACAGAAAG C 

CAAGCACATACACTGGAAGATGAAGTAGCAGAGGT TTTACAAAAAT TAAT C TCAAAGGAAGCCAACAA 
TTATGAGGAGGATCCCAATAAGCCCACAAGCTGGACTGAGAATCAGGCTGGAAAAATACCAGAGAAAG 
p| TGACTCCAATGGCAGCAATTCAAGATGGTCTTGCTAAGGGAGAAAACGATGAAACAGTATCTAACACA 
S||J TTAACCTTGACAAATGGCTTGGAAAGGAGAACTAAAACCTACAGTGAAGACAACTTTGAGGAACTCCA 
# ATATTTC CCAAATTTC TATGCGCTACT GAAAAGTATTGAT TCAGAAAAAGAAGCAAAAGAGAAAGAAA 

% C AC T GAT TACT AT CAT GAAAACAC TGATT GACT TT GT GAAGATGAT GGT GAAATATGGAACAAT ATCT 

n 

CCAGAAGAAGGTGTTTCCTACCTTGAAAACTTGGATGAAATGATTGCTCTTCAGACCAAAAACAAGCT 
AGAJ^AAASATGCTACTGACAATATAAGCAAGCTTTTCCCAGCACCATCAGAGAAGAGTCATGAAGAAA 
pi CAGAC AG TACC AAGGAAGAAGCAGCTAAGAT G GAAAAGGAAT AT G GAAGCT T GAAG GAT T CC AC AAAA 

P| GAT GATAACTCC^CC CAGGAGGAAAGACAGATGAACCCAAAGGAAAAACAGAAGCCTAT T TGGAAGC 

f|| CAT CAGAAAAAATATT GAAT GGTT GAAGAAACATGACAAAAAGG GAAATAAAGAAGATTATGAC CTTT 

CAAAGATGAGAGACTTCATCAATAAACAAGCTGATGCTTATGTGGAGAAAGGCATCCTTGACAAGGAA 
GAAGCCGAGGCCATCAAGCGCATTTATAGCAGCCTGTAAAAATGGCAAAAGATCCAGGAGTCTTTCAA 
CTGTTTCAGAAAACATAATATAGCTTAAAACACTTCTAATTCTGTGATTAAAATTTTTTGACCCAAGG 
GTTATTAGAAAGTGCTGAATTTACAGTAGTTAACCTTTTACAAGTGGTTAAAACATAGCTTTCTTCCC 
GTAAAAACTATCTGAAAGTAAAGTTGTATGTAAGCTGAAAAA^WIAAAAAAAAAAAA 



m 
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FIGURE ISO 

MG FLGT GTW I LVLVLP I QAFPKP GGS QDKS LHNRELS AERP LNEQ I AEAEE DK I KKT Y PPENKPG 
QS NYS FVDNLNLLKAI TEKEKI EKERQS I RS S PLDNKLNVEDVDS TKNRKL I DD YDS TKS GLDHK 
FQDDPDGLBQLDGT PLTAE D I VHKI AARI YEENDRAVFDKI VSKLLNLGL I TE S QAHTLEDEVAE 
VLQKLISKEANNYEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKTYSEDNFEELQYFPNFYALLKSIDSEKEAKEKETLITIMKTLIDFVKMMVKYGTISPEEGV 
SYLENLDEMTALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDEPKGKTEAYLEAIRKNIEWLKKHDKKGNKEDYDLSKMRDFINKQADAYVEKGILDK 
EEAEAIKRIYSSL 

N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 

Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 

220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 

I N-myristoylation sites: 

J amino acids 143-148, 239-244 

1 
n 

% 
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FIGURE 151 



CGGCTCGAGGCTCCCGCCAGGAGAAAGGAACATTCTGAGGGGAGTCTACACCCTGTGGAGCTCAA 
GATGGTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
ATAATAACCAGCTTCTAGCTGGAGGGCTGCATGCAGGGAAGGTCATTAAAGGTGAAGAGATCAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAGTGCCTGTCATGTGGGGTGGGGCAGGAGCCGACTCTAACACTAGAGCCAGTGAACATCATGG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
TGTCAGACTCACCCAGCTTCCCGAGAATGGTGGCTGGAATGCCCCCATCACAGACTTCTACTTCC 
AGCAGTGTGAC TAG GGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
GAGTGGAGGAGACC CATGGCGGACAAT CAC TCTCTCTGC T CTCAGGACC CCCACGTCT GACT TAG 
T GGGCACC T GACCACT T TGTCTTCTGGTTCC CAG T T TGGATAAAT T C TGAGAT T T GGAGC T CAGT 
CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
TAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 
GGGTCAGTAGCTCTCCACATGAAGTCCTGTCACTCACCACTGTGCAGGAGAGGGAGGTGGTCATA 
GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 
GAAAT GGC T CGAGC TCAGAAGATAAAAGATAAGT AGGGT ATGCTGATCCT CTTTTAAAAACC CAA 
f] GATACAATCAAAATCCCAGATGCTGGTCTCTATTCCCATGAAAAAGTGCTCATGACATATTGAGA 
\M AGACC TAC T TACAAAGT GGCATATAT T GCAATT TATT TTAATTAAAAGATACC TAT T TATATATT 

P| TCTTTATAGAAAAAAGTCTGGAAGAGTTTACTTCAATTGTAGCAATGTCAGGGTGGTGGCAGTAT 
J% AGGTGATTTTTCTTTTAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGA 
n f % ATTCCTTG T AT AAAAAT AAGAAAAGAAAT T AAT C T T GAG G T AAGCAGAGC AG AC AT CAT C T C T GA 

TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
ii TTGTAGTAGTGATCAGGAAACAGATCTCAGCAAAGCCACTGAGGAGGAGGCTGTGCTGAGTTTGT 
P| GTGGCTGGAATCTCTGGGTAAGGAACTTAAAGAACAAAAATCATCTGGTAATTCTTTCCTAGAAG 
j'fl GATCACAGCCCCTGGGATTCCAAGGCATTGGATCCAGTCTCTAAGAAGGCTGCTGTACTGGTTGA 
Jg~ ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 
%f>l GTCTCTGCAGA.TGTAGTTAGTTAAGACAAGGTCATGCTGGATGAAGGTAGACCTAAATTCAATAT 
III! GACT GG T T T C C T T GT ATGAAAAGGAGAG GAC AC AGAGAC AG AGGAGAC G C GG GGAAGAC TAT G T A 

P AAGATGAAGGCAGAGATCGGAGTTTTGCAGCCACAAGCTAAGAAACACCAAGGATTGTGGCAACC 
ATCAGAAGCTTGGAAGAGGCAAAGAAGAATTCTTCCCTAGAGGCTTTAGAGGGATAACGGCTCTG 
CTGAAACCTTAATCTCAGACTTCCAGCCTCCTGAACGAAGAAAGAATAAATTTCGGCTGTTTTAA 
GCC AC CAAGGATAAT TGGT TACAGC AGC T C TAGG AAAC TAAT ACAGC T GC T AAAAT GATC CCT GT 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
TAGAATATGGCAGAAGTGATGGCATGCCACTTCCAAGATTAGGTTATAAAAGACACTGCAGCTTC 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGCTAGCTGCCATGCTATGAGCAGGC CTATAAAGAGACT TACGTGGTAAAAAATGAAGT CT CCT 
GCCCACAGCCACATTAGTGAACCTAGAAGCAGAGACTCTGTGAGATAATCGATGTTTGTTGTTTT 
AAGT TGCT CAGT T T TGGT CTAAC T TG T TATGCAGCAATAGATAAATAAT AT GCAGAGAAAGAG 



m 



m 
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FIGURE 1*2 

1WLSGALCFRMKDSALKVLYLHNNQLLAGGLHAGKVIKGEEISWPNRWLDASLSPVILGVQGGS 
QCLSCGVGQEPTLTLEPVNIMELYLGAKESKSFTFYRRDKGLTSSFESAAYPGWFLCTVPEADQP 
VRLTQLPENGGWNAPITDFYFQQCD 



m 
Ml 

m 

m. 
o 

m 
cr 



N-myristoylation sites. 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 

Interleukin-l signature. 

amino acids 111-131 

Interleukin-1 proteins. 

amino acids 8-29, 83-120, 95-134, 64-103 
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FIGURE 15 3 



CTTCAGAA.CAGGTTCTCCTTCCCCAGTCACCAGTTGCTCGAGTTAGAATTGTCTGCAATGGCC6C 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
T T C CAGCAGCCCT AT AT CACCAACCGCACC T TCAT GC T GGC TAAG GAGGCT AGC T TGGC T GAT AA 
CAACACAGACGTTCGTCTCATTGGGGAGAAACTGTTCCACGGAGTCAGTATGAGTGAGCGCTGCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
TAT TGAAGGTGATGAC CTGCATAT C CAGAGGAATGT G CAAAAGC TGAAGGACACAGTGAAAAAGC 
T T GGAGAGAG TGGAGAGAT C AAAGCAAT T GGAGAAC T GGAT T T GC TGT T T ATGTCTCT GAGAAAT 
J,,j, GC C TGCAT TTGACC AGAGC AAAGCT GAAAAAT GAAT AACT AACCC C C TT TC CCTGGTAGAAATAA 

Q CAATT AGAT GCCCCAAAGCGAT T T TT T T T AA.C CAAAAGGAAGAT GGGAAGCCAAAC TCCATCAT G 

Q ATGGGTGGATTCCAAATGAACCCCTGCGT.TAGTTACAAAGGAAACCAATGCCACTTTTGTTTATA 

f I AGACCAGAAGGTAGACTTTCTAAGCATAGATATTTATTGATAACATTTCATTGTAACTGGTGTTC 

ifj 

T AT ACACAGAAAAC AAT T TAT T T T T T AAAT AAT TGTCTTTTTC C AT AAAAAAGAT T AC T T T CCAT 

kh 

g% T CCT T TAGGGGAAAAAACCCC TAAATAGCTTCATGT TT CCATAAT CAGT AC TTTATAT T TAT AAA 

&|| T GT AT TT ATTATTATT ATAAGACTGCAT T T TAT T TAT AT CAT TT TATTAATATGGATTTAT TTAT 

* AGAAACATCATTCGATATTGCTACTTGAGTGTAAGGCTAATATTGATATTTATGACAATAATTAT 

m 

fZ AGAG C T AT AAC AT G T T TAT T T GAC C T C AATAAACAC T T G GAT AT C C C 

iifl 

m 

m 

m 

Pfi 
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FIGURE izj. 

MAALQKSVSSFLMGTLATSCLLLLALLVQGGAAAPISSHCRLDKSNFQQPYITNRTmi^EASL 
ADHNTDVRL IGEICL FHGVSMSERCYLMKQVLNFTLEEVL FPQS DRFQP YMQEWP FLARLSNRLS 
TCHIEGDDLHI QRNVQKLKDT VKKLGE S GE I KA I GE LDLL FMS LRNAC I 



Important features of the protein: 
Signal peptide: 

amino acids 1-33 



ri 

I'll 



N-glycosylation sites. 

amino acids 54-58, 68-72, 97-101 



N-myristoylation sites. 

amino acids 14-20, 82-88 

m 

k£j ProJtaryotic membrane lipoprotein lipid attachment site, 

amino acids 10-21 

% . 

0 

If! 



App_ID=10063699 



Page 308 



FIGURE 155 



GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGTCAGTGCCCGACTTGTGACTGAGTGTGCAGTGCCCAGCATGTACCAGGTCAGTGCAGAGGGC 
TGC CT GAGGGCT GT GCTGAGAGGGAGAGGAGCAGAGAT GCTGC TGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACC CAGATTAGG TGAGGACAGT TCTCTCAT TAGCCT TT TCCTACAGGTGGT TGCAT 
TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
GCCCAACCGCCACCCAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 
CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 
s . TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 
ft GCTGCTCTACCACAACCAGACTGTCTTCTACAGGCGGCCATGCCATGGCGAGAAGGGCACCCACA 

JfBPS? 

Q AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 
.fcM GTGATGGGC TAG CCGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 
ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 
CAGGATCCCGGGACAGGATGGGGGGCT TTGGGGAAAACCT GCACTTCTGCACATTTTGAAAAGAG 
CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 
« GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 
t3 GCCCAGCACAGGCACTTTCTAGATATTTCCCCCTTGCTGGAGAAGAAAGAGCCCCTGGTTTTATT 



41 



TGTTTGTTXACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 
T T G AC AT G GAT GAT T C T GAGG AG GAAGC T G T TAT T GAAT GTATAGAGAT T T AT C CAAA TAAATAT 



m. 

m CTTTATTTAAAA7VTGAAAAA 
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FIGURE 1*6 

MRERPRLGEDSSLISLFLQWAFLAMWGTHTYSHWPSCCPSKGQDTSEELLRWSTVPVPPLE 

RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 

ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 



Important features of the protein: 
Signal peptide: 

amino acids 1-32 



N-glycosylation site. 

jwh amino acids 136-140 



m 
hi 

m 



m 

m 



Tyrosine kinase phosphorylation site. 

amino acids 127-135 

N-rnyristoylation sites. 



W. amino acids 44-50, 150-156 
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FIGURE 157 



CCGGCGATGTCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGTTCAATGTGGCTCTGAAACTGGGeCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
CCGGAGACTTGAGGGACCTCCGAGTAGAACCTGTTACAACTAGTGTTGCAACAGGGGACTATTCA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 
GATTTGTGTGACGGGCAAAAGCAACTTCCAGTCCTACAGCTGTGTGAGGTGCAATTACACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 
GAGCT GAACACAG T C TAT T TCAT T GGGG C CCATAAT AT T C C TAAT GCAAAT ATGAATGAAGATGG 
CCCTTCCATGTCTGTGAATTTCACCTCACCAGGCTGCCTAGACCACATAATGAAATATAAAAAAA 
AGTGTGTCAAGGCCGGAAGCCTGTGGGATCCGAACATCACTGCTTGTAAGAAGAATGAGGAGACA 
GT AGAAGTGAAC TT CACAAC CACTCCCC T G GGAAAC AGAT AC ATGGC T C TT AT C GAACAC AGCAC 
TATCATCGGGTTTTCTCAGGTGTTTGAGCCACACCAGAAGAAACAAACGCGAGCTTCAGTGGTGA 
W TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 

r ! AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 

ill 

s&\ GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 
i.f\ CATGGGTGCTGGTGGCAGGGATCTATCTAATGTGGAGGCACGAAAGGATCAAGAAGACTTCCTTT 
till TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 
* TCACACAATTTGTTACTTCACTGAATTTCTTCAAAACCATTGCAGAAGTGAGGTCATCCTTGAAA 
||j AGTGGCAGAAAAAGASAATAGCAGAGATGGGTCCAGTGCAGTGGCTTGCCACTCAAAA.GAAGGCA 
M GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 
||| GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 
£jj GTGATCTAAGAAGCCAGATTCATCTGCACAAATACGTGGTGGTCTACTTTAGAGAGATTGATACA 
AAAGACGATTACAATGCTCTCAGTGTCTGCCCCAAGTACCACCTCATGAAGGATGCCACTGCTTT 
CTGTGCAGAACTTCTCCATGTCAAGCAGCAGGTGTCAGCAGGAAAAAGATCACAAGCCTGCCACG 
ATGGCTGCTGCTCCTTGTAG 



Hi 
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FIGURE 1*8 

MSLVLLSLAALCRSAVPREPTVQCGSETGPSPEWMLQHDLIPGDLRDLRVEPVTTSVATGDYSILMNVSWV 

LRADASIRLLKATKICVTGKSNFQSysCVRCNYTEAFQTQTRPSGGKWTFSYIGFPVELNTVYFIGAHNIP 

NANMNE DGP SMS VNFT S PGC^DH IMKYKKKCVKAGSLW DPNI TACKKNEETVE VN FTTTPLGNRYMALIQH 

ST I IGFSQVFEPHQKKQTRASWI PVTGDSEGAT VQLTPY FPTCGSDC IRHKGTWLCPQTGVP FPLDNNK 

SKPGGWLPLLLLSLLVATWVLV^ 

QNHCRSEVILEKWQKKKIAEMGPVQWIA^ 

FNLFCSDLRSQIHLHKYVWYFREIDTKDDYNALSVCPKYHLMKDATAFCAELLHVKQQVSAGKRSQACHD 
GCCSL 



8! 
W 
HI 



o 

to 



Important features of the protein: 
Signal peptide: 

amino acids 1-14 

Transmembrane domain: 

amino acids 290-309 

N-glycosylation sites. 

amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 
- 287 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 228 - 232 and 319 - 323 

Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 



N-myristoylation site. 

amino acids 116-122 



Amidation site. 

amino acids 488-452 
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FIGURE 15Q 



jlf! 

m 

m 



m 
d 
ru 



AGCCACCAGCGCAACATGACAGT GAAGACCCTGCAT GGCC CAGCCAT GGTCAAGTACTT GCT GCT 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCCCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
ACTTGGGCTGCATCAATGCTCAAGGAAAGGAAGACATCTCCATGAATTCCGTTCCCATCCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCAGTAAGAGGTGCATATCC 
AC T CAGC TGAAGAAG 
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FIGURE 160 



MTVKTLHGPAKVKYLLLSILGLAFLSEAAARKIPKVGHTFFQKPESCPPVPGGSMKLDIGIINEN 
QRVSMSRN 1 5 SRS TS P WN YT VT WDPNRYPS E WQAQCRMLGC INAQGKE D I SMNS VP I QQE T L W 
RRKHQGCS VS FQLE KVLVT VGC T CVT PVI HHVQ 

Signal sequence: 

amino acids 1-30 

N-glycosylation site. 

amino acids 83-87 




N-myristoylation sites. 



amino acids 106-111, 136-141 



id 

in 



% 
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FIGURE 161 



ACACT GGCGAAAGAAAAACGAAAGC AC T CCGTGCT GGAAGTAGGAGGAGAGTCAGGAC T CCCAGG 
ACAGAGAGTGCACAAACTACCCAGCACAGCCCCCTCCGCCCCCTCTGGAGGCTGAAGAGGGATTC 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAGATGCCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCC TGAAGATGAGGAAAAGT TT GGAGGAGCAG C TGAC TCAGGGGT GGAGGAGCC T AGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
TGCTTCGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
GGAACTCAACCACACACAGCAGCTGCCTGCCCTGCCCTGGCTCAACGTGTCAGCAGATGGTGACA 
ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 
f»| CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
p| CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
lM TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
f|| AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
GTGGCTGTACTTCACTACCCAGCAAAGCCTCCACGAGGGCAGCTCGCCTTGGAGAGTACTTACTA 
HI CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 
9 CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
|f| CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 
H| CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
*j[ CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 
fcl CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 
|!| CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
f3 GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
f|| GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
^ GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
C GGGAC T TAAAT AAAGGCAGACGC TGT T T TT CTAAAAAAA 



IP 
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FIGURE 162 



■5 

fit! 



£5 



MPVPWFLLSLALGRSPWLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
LRCQ KETDCDLCLRVAVHIAVHGHWEEPEDEEKFGGAADS GVEE P RNASLQAQWL S FQAYPTARCVLLEV 
QVPAALVQ FGQSVGSWY DC FEAALGSE VRIW SYTQPRY E KELNHTQQLPAL PWLN VSADGDNVHL VLNVS 
EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLRLLTLQSWLLDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWENVTVDKVLEFPLLKGHPNLCVQ 
VNSSEKLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 
GQCLQLWDDDLGALWACPMDKYIHKRWALWLACLLFAAALSLILLLKKDHAKGWLRLLKQDVRSGAAARG 
RAALLLYSADDSGFERLVGALASALCQLPLRVAVDLWSRRELSAQGPVAWFHAQRRQTLQEGGVWLLFSP 
GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 

Signal sequence: 



$S% amino acids 1-20 



Transmembrane domain. 

amino acids 453-475 



N-glycosylation sites. 

amino acids 118-121, 
334-337, 357-360, 391-394 



n amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 

m 
m 

I 



Glycosarninoglycan attachment site. 

amino acids 583-586 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 552-555 

N-myristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 163 



m 



GGGAGGGCTCTGTGCCAGCCCCGATCAGGACGCTGCTGACCATCTTGACTGTGGGATCCCTGGCT 
GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 
AAACATCCTGACGTGGGACAGCGGGCCAGAGGGCACCCCAGACACGGTCTACAGCATCGAGTATA 
AGACGTACGGAGAGAGGGACTGGGTGGCAAAGAAGGGCTGTCAGCGGATCACCCGGAAGTCCTGC 
AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 
GTCAGTGCGGGAGGCCGGTCAGCCACCAAGATGACTGACAGGTTCAGCTCTCTGCAGCACACTAC 
CCTCAAGCCACCTGATGTGACCTGTATCTCCAAAGTGAGATCGATTCAGATGATTGTTCATCCTA 
CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 
T TC TACCAC T TAGAGCT CC AGGT CAACCGCAC CT AC CAAAT GCACC T T GGAGGGAAGC AGAGAGA 
ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 
CCTGGGCCAAGGAGAGTGCCCCCTACATGTGCCGAGTGAAGACACTGCCAGACCGGACATGGACC 
TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 
CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 
TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 
j?w AG T C T GGCCCAGCCTGTCCAGTACTCCCAGAT CAG GG T G T CTGGACCCAGGGAGCCCGCAGGAGC 

f! TCCACAGCGGCATAGCCTGTCCGAGATCACCTACTTAGGGCAGCCAGACATCTCCATCCTCCAGC 
CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 
GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 
GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 
\M CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 
p| CACCTTAGGCCTAAAGGTCAGCTTCAGAAAGAGCCACCAGCTGGAAGCTGCATGTTAGGTGGCCT 
Tf| TTCTCTGCAGGAGGTGACCTCCTTGGCTATGGAGGAATCCCAAGAAGCAAAATCATTGCACCAGC 
CCCTGGGGATTTGCACAGACAGAACATCTGACCCAAATGTGCTACACAGTGGGGAGGAAGGGACA 
■W CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 
f? CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 
Q TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 
||| GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 
jjJJ AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 
M ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 
||| AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 
p AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 
CAAGGCAGAAATGACAGTGCAAGGAGGAAATGCAGGGAAACTCCCGAGGTCCAGAGCCCCACCTC 
CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 
ACAATCTAGCTCGACAGAGCATGAGGCCCCTGCCTCTTCTGTCATTGTTCAAAGGTGGGAAGAGA 
GCCTGGAAAAGAACCAGGCCTGGAAAAGAACCAGAAGGAGGCTGGGCAGAACCAGAACAACCTGC 
ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 
TTCCCAGCCAGGGC^lACTGCCTGACGTTGCACGATTTCAGCTTCATTCCTCTGATAGAACAAAGC 
GAAATGCAGGTCCACCAGGGAGGGAGACACACAAGCCT TT TCTGCAGGCAGGAGTTTCAGACCCT 
ATCCTGAGAATGGGGTTTGAAAGGAAGGTGAGGGCTGTGGCCCCTGGACGGGTACAATAACACAC 
TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAATTCCAGC 
CTCACCACTCACAAGCTGTGTGACTTCAAACAAATGAAATCAGTGCCCAGAACCTCGGTTTCCTC 
ATCTGTAATGTGGGGATCATAACACCTACCTCATGGAGTTGTGGTGAAGATGAAATGAAGTCATG 
T C T T T AAAGTGCT T AATAGT GCCTGGTACATGGGCAG T G C CCAATAAACGGTAGC TATT TAAAAA 
AAAAAAAA 



ni 
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FIGURE 164 



MRTLLTILTVGSLAAHAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VAKKGCQRI TRKS CNLTVETGNLTE LYYARVTAVSAGGRSATKMTDRFSSLQHTTLKP PDVTC I S 
KVRSIQMIVHPTPTPIRAGDGHRLTLEDIFHDLFYHLELQWRTYQMHLGGKQREYEFFGLTPDT 
EFLGTIMICVPTWAKESAPYMCRVKTLPDRTWTYS FSGAFLFSMGFLVAVLCYLSYRYVTKPPAP 
PNSLNVQRVLTFQPLRFIQEHVLIPVFDLSGPSSLAQPVQYSQIRVSGPREPAGAPQRHSLSEIT 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQATPDSWPPS YGVCMEGSGKDS PTGTLS SPKHLRPKGQLQKEP PAGSCMLGGLSLQEVTSLAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 

Signal sequence. 

amino acids 1-17 



id 

P*. Transmembrane domain. 



amino acids 233-250 



- N-glycosylation sites. 

CI amino acids 80-83, 87-90, 172-175 

iff 

Cl 

N-myristoylation sites. 

HI 

»i* amino acids 11-16, 47-52, 102-107, 531-536, 565-570 
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FIGURE 165 



TGGCCTACTGGAMAAAAAAAAAAAAA^ 

CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGGAT 
GT T AATGTACCGT GG GAAAGC TC T T GAAGACT TCAC GGG C C C T GAT T GTCGTTTTGT GAAT T T T A 
AAAAAGGTGACGATGTATATGTCTACTACAAACTGGCAGGGGGATCCCTTGAACTTTGGGCTGGA 
AGTGTTGAACACAGTTTTGGATATTTTCCAAAAGATTTGATCAAGGTACTTCATAAATACACGGA 
AG AAGAGC T AC AT AT T C C AGC AG AT G AG AC AGAC TTTGTCTGCTTT GAAG GAGGAAGAGAT GAT T 
T TAATAGT TATAAT G T AGAAG AG C T T T TAGGAT C T T T GG AAC T GGAGG ACT CT GT ACC T GAAG AG 
TCGAAGAAAGCTGAAGAAGTTTCTCAGCACAGAGAGAAATCTCCTGAGGAGTCTCGGGGGCGTGA 
\A ACTTGACCCTGTGCCTGAGCCCGAGGCATTCAGAGCTGATTCAGAGGATGGAGAAGGTGCTTTCT 
S CAGAGAGCACCGAGGGGCTGCAGGGACAGCCCTCAGCTCAGGAGAGCCACCCTCACACCAGCGGT 

m 

5? CCTGCGGCTAACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGATAA 

PI 

/ b ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGCAATAGTTCTCCTGCCTCGGTGGAGCGGGAGA 

iM 

p| AGACAGATGCTTACAAAGTCCTGAAAACAGAAATGAGTCAGAGAGGAAGTGGACAGTGCGTTATT 
CATTACAGCAAAGGAT T TCGT T GGCAT CAAAATCTAAGTT TGTTT TACAAAGATTGTTTT TAG TA 
m CTAAGCTGCCTTGGCAGTTTGCATTTTTGAGCCAAACAAAAATATATTATTTTCCCTTCTAAGTA 
l c AAAAAAAAAAAAAAAAAAAA 



m 
'Q 
ft! 
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FIGURE 166 



ill 



MAAAPGLLFWLFVLGALWWVPGQSDLSHGRRFSDLCTCGDEECSMIJyrYRGKALEDFTGPDCRFVN 
FKKGDDVYVYYKLAGGSLELWAGSVEHS FGYFPKDLIKVLHKYTEEELHIPADETDFV"CFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDKLKVPGSESRTGNSSPASVER 
EKTDAYKVLKTEMSQRGSGQCYIHYSKGFRWHQNLSLFYKDCF 

Important features of* the protein: 
Signal peptide: 

amino acids 1-22 

N-glycosylation site. 

amino acids 294-298 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 30-34 

Tyrosine kinase phosphorylation site. 

amino acids 67-7 6 



I 

0 N-rayristoylation sites. 

amino acids 205-211, 225-231, 277-283 



Ainidation site. 

amino acids 28-32 
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FIGURE 167 



CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GT CCAGCCGAGT GAG C T TTCAGGAGGCACGCC TGGC T T GT GAGAG T GAGG GAGGAGT C CT CC T CA 
GCCT T GAGAAT GAAGCAGAACAGAAG T TAATAGAGAG CATG T TGCAAAACC T GACAAAAC CC GGG 
ACAGGGAT T T CT GATGG TGAT TT CT GGATAGGGC T TT GGAG GAAT GGAGAT GGGCAAACATC T GG 
T G C C T GCC CAGAT CT C TACCAGT GGTC TGATGGAAGCAAT T CCCAG T AC CGAAACTGG TACACAG 
ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 
CTTGGGGGTCCCTACCTTTACCAGTGGT^ATGATGACAGGTGTAACATGAAGCACAATTATATTTG 
pi C AAG TAT GAACCAGAGAT TAAT CCAACAGCCCCT GT AGAAAAGC C T T AT CT TACAAATCAAC CAG 

GAGACACCCATCAGAATGTGGT T G TTACTGAAGCAGG T AT AATT CC CAATC TAAT T TAT GT T G T T 
^jf ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 
Of GCATAAAAGTAAAGGAAGAACAAAAAC TAG TCCAAACC AGTCTACAC T GTGGAT T T CAAAGAGT A 

C CAGAAAAGAAAGT GGCAT GGAAG T ATAATAACTCAT T GAC TTGGT T C CAGAAT T T TG TAAT T CT 
GGAT CTGTATAAGGAATGGCATCAGAACAATAGC TTGGAATGGCTTGAAATCACAAAGGATCTGC 
AAGAT GAACTG TAAGC TCCCCCT T GAGGCAAATAT T AAAGTAAT T TT TA.TAT GTC TAT TATT T CA 
T T TAAAGAATATGCTGTGCTAATAATGGAGTGAGACAT GCT TATT TT GC TAAAGGATGCACCCAA 
■W AC T TCAAAC T T CAAG CAAATGAAAT G GACAATGCAGAT AAAGT T G TT AT CAACACG T C GGGAG T A 

g TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
TGTATATTGTATTGAAATTTACAGTGTGCAAAAGTATTTTACCTTTGCATAAGTGTTTGATAAAA 
ATGAACTGTTCTAATATTTATTTTTATGGCATCTCATTTTTCAATACATGCTCTTTTGATTAAAG 
AAACT TAT TACT G T TGTCAAC TGAAT T CACACACACACAAATATAGT ACCATAGAAAAAGTT T GT 
fol TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 
III AATAAGAAGC TAT T T CAT TAAGT GT GATATAAAC C T C C TCAAACATT T T AC T TAGAGGCAAGGAT 

f'1 T G T C TAAT T TCAAT T GT GCAAGACAT GTGCC T T AT AAT TAT T T T T AGC T TAAAAT T AAACAGAT T 

TTGTAATAATGTAACTTTGTTAATAGGTGCATAAACACTAATGCAGTCAATTTGAACAAAAGAAG 
T G AC ATAC AC AAT AT AAAT CAT AT GT C T T C AC AC G T T G C C TAT AT AAT GAG AAG C AGC T C T C T GA 

GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 
AT T GAC AC T G GAGG CAGAT AG T T GC AAAGT TAGT CT AAG G T T T C CC T AGC T G TA T T TAGC C T C T G 
AC T ATAT TAG TATACAAAGAGGT CATGTGG TT GAGAC CAGG TGAATAG T CAC TATCAG TGT GGAG 
ACAAGCACAGCACACAGACATTT TAGGAAGGAAAGGAACTACGAAATCGTGT GAAAATGGGT TGG 
AACCCAT CAG TGAT CGCATAT T CAT TGAT GAGGG T T T GCT T GAGATAGAAAAT G GT GGC T CCT T T 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 168 

MSRWSLLLGAALLCGHGAFCRRWSGQKVCFADFKHPCYKMAYFHELSSRVSFQEARLACESE 

GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQT3GACPDLYQWSDGSNSQ 

YRI^YTDEPSCGSEKCVVMYHQPTANPGLGGPYLYQWNDDRCNMKHNYICKYEPEINPTAPVEK 

PYLTNQPGDTHQNVWTEAGIIPNLIYWIPTIPLLLLILVAFGTCCFQMLHKSKGRTKTSPNQ 
STLWISKSTRKESGMEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

Transmembrane domain: 



amino acids 214-235 

N-glycosylation sites. 

III amino acids 86-89, 255-258 



cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 266-2 69 

N-myristoylation sites. 

amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 
145, 212-217 
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